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Chapter One 


ELECTRIC CHARGES AND 
FIELDS 


1.1 Introduction 

All of us have the experience of seeing a spark or hearing a crackle when 
we take off our synthetic clothes or sweater, particularly in dry weather. 
This is almost inevitable with ladies garments like a polyester saree. 
Have you ever tried to find any explanation for this phenomenon? 
Another common example of electric discharge is the lightning that we 
see in the sky during thunderstorms. We also experience a sensation of 
an electric shock either while opening the door of a car or holding the iron 
bar of a bus after sliding from our seat. The reason for these experiences 
is discharge of electric charges through our body, which were 
accumulated due to rubbing of insulating surfaces. You might have also 
heard that this is due to generation of static electricity. This is precisely 
the topic we are going to discuss in this and the next chapter. Static 
means anything that does not move or change with time. Electrostatics 
deals with the study of forces, fields and potentials arising from static 
charges. 

1.2 Electric Charge 

Historically the credit of discovery of the fact that amber rubbed with wool 
or silk cloth attracts light objects goes to Thales of Miletus, Greece, 
around 600 BC. The name electricity is coined from the Greek word 
elektron meaning amber. Many such pairs of materials were known which 




on rubbing could attract light objects like straw, pith balls and bits of 
papers. You can perform the following activity at home to experience 
such an effect. Cut out long thin strips of white paper and lightly iron 
them. Take them near a TV screen or computer monitor. You will see that 
the strips get attracted to the screen. In fact they remain stuck to the 
screen for a while. 






Figure 1.1 Rods and pith balls: like charges repel and unlike charges attract each other. 
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Interactive animation on simple electrostatic experiments: 
http://ephysics.physics.ucla.edu/travoltage/HTML/staticElectricity.htm 


It was observed that if two glass rods rubbed with wool or silk cloth are 














brought close to each other, they repel each other [Fig. 1.1(a)]. The two 
strands of wool or two pieces of silk cloth, with which the rods were 
rubbed, also repel each other. However, the glass rod and wool attracted 
each other. Similarly, two plastic rods rubbed with cat’s fur repelled each 
other [Fig. 1.1(b)] but attracted the fur. On the other hand, the plastic rod 
attracts the glass rod [Fig. 1.1(c)] and repel the silk or wool with which the 
glass rod is rubbed. The glass rod repels the fur. 

If a plastic rod rubbed with fur is made to touch two small pith balls (now¬ 
adays we can use polystyrene balls) suspended by silk or nylon thread, 
then the balls repel each other [Fig. 1.1(d)] and are also repelled by the 
rod. A similar effect is found if the pith balls are touched with a glass rod 
rubbed with silk [Fig. 1.1(e)]. A dramatic observation is that a pith ball 
touched with glass rod attracts another pith ball touched with plastic rod 
[Fig. 1.1(f)], 

These seemingly simple facts were established from years of efforts and 
careful experiments and their analyses. It was concluded, after many 
careful studies by different scientists, that there were only two kinds of an 
entity which is called the electric charge. We say that the bodies like 
glass or plastic rods, silk, fur and pith balls are electrified. They acquire 
an electric charge on rubbing. The experiments on pith balls suggested 
that there are two kinds of electrification and we find that (i) like charges 
repel and (ii) unlike charges attract each other. The experiments also 
demonstrated that the charges are transferred from the rods to the pith 
balls on contact. It is said that the pith balls are electrified or are charged 
by contact. The property which differentiates the two kinds of charges is 
called the polarity of charge. 

When a glass rod is rubbed with silk, the rod acquires one kind of charge 
and the silk acquires the second kind of charge. This is true for any pair 
of objects that are rubbed to be electrified. Now if the electrified glass rod 



is brought in contact with silk, with which it was rubbed, they no longer 
attract each other. They also do not attract or repel other light objects as 
they did on being electrified. 

Thus, the charges acquired after rubbing are lost when the charged 
bodies are brought in contact. What can you conclude from these 
observations? It just tells us that unlike charges acquired by the objects 
neutralise or nullify each other’s effect. Therefore, the charges were 
named as positive and negative by the American scientist Benjamin 
Franklin. We know that when we add a positive number to a negative 
number of the same magnitude, the sum is zero. This might have been 
the philosophy in naming the charges as positive and negative. By 
convention, the charge on glass rod or cat’s fur is called positive and that 
on plastic rod or silk is termed negative. If an object possesses an 
electric charge, it is said to be electrified or charged. When it has no 
charge it is said to be electrically neutral. 

M | 

Unification of electricity and magnetism 

In olden days, electricity and magnetism were treated as separate 
subjects. Electricity dealt with charges on glass rods, cat’s fur, 
batteries, lightning, etc., while magnetism described interactions of 
magnets, iron filings, compass needles, etc. In 1820 Danish scientist 
Oersted found that a compass needle is deflected by passing an 
electric current through a wire placed near the needle. Ampere and 
Faraday supported this observation by saying that electric charges in 
motion produce magnetic fields and moving magnets generate 
electricity. The unification was achieved when the Scottish physicist 
Maxwell and the Dutch physicist Lorentz put forward a theory where 
they showed the interdependence of these two subjects. This field is 
called electromagnetism. Most of the phenomena occurring around 




us can be described under electromagnetism. Virtually every force 
that we can think of like friction, chemical force between atoms 
holding the matter together, and even the forces describing 
processes occurring in cells of living organisms, have its origin in 
electromagnetic force. Electromagnetic force is one of the 
fundamental forces of nature. 

Maxwell put forth four equations that play the same role in classical 
electromagnetism as Newton’s equations of motion and gravitation 
law play in mechanics. He also argued that light is electromagnetic in 
nature and its speed can be found by making purely electric and 
magnetic measurements. He claimed that the science of optics is 
intimately related to that of electricity and magnetism. 

The science of electricity and magnetism is the foundation for the 
modern technological civilisation. Electric power, telecommunication, 
radio and television, and a wide variety of the practical appliances 
used in daily life are based on the principles of this science. Although 
charged particles in motion exert both electric and magnetic forces, in 
the frame of reference where all the charges are at rest, the forces 
are purely electrical. You know that gravitational force is a long-range 
force. Its effect is felt even when the distance between the interacting 
particles is very large because the force decreases inversely as the 
square of the distance between the interacting bodies. We will learn 
in this chapter that electric force is also as pervasive and is in fact 
stronger than the gravitational force by several orders of magnitude 
(refer to Chapter 1 of Class XI Physics Textbook). 

A simple apparatus to detect charge on a body is the gold-leaf 
electroscope [Fig. 1.2(a)]. It consists of a vertical metal rod housed in a 
box, with two thin gold leaves attached to its bottom end. When a 




charged object touches the metal knob at the top of the rod, charge flows 
on to the leaves and they diverge. The degree of divergance is an 
indicator of the amount of charge. 

Students can make a simple electroscope as follows [Fig. 1.2(b)]: Take a 
thin aluminium curtain rod with ball ends fitted for hanging the curtain. 
Cut out a piece of length about 20 cm with the ball at one end and flatten 
the cut end. Take a large bottle that can hold this rod and a cork which 
will fit in the opening of the bottle. Make a hole in the cork sufficient to 
hold the curtain rod snugly. Slide the rod through the hole in the cork with 
the cut end on the lower side and ball end projecting above the cork. Fold 
a small, thin aluminium foil (about 6 cm in length) in the middle and 
attach it to the flattened end of the rod by cellulose tape. This forms the 
leaves of your electroscope. Fit the cork in the bottle with about 5 cm of 
the ball end projecting above the cork. A paper scale may be put inside 
the bottle in advance to measure the separation of leaves. The 
separation is a rough measure of the amount of charge on the 
electroscope. 



Metal knob 





(a) 
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(b) 


Figure 1.2 Electroscopes: (a) The gold leaf electroscope, (b) Schematics of a simple 

electroscope. 


To understand how the electroscope works, use the white paper strips we 
used for seeing the attraction of charged bodies. Fold the strips into half 
so that you make a mark of fold. Open the strip and iron it lightly with the 
mountain fold up, as shown in Fig. 1.3. Hold the strip by pinching it at the 
fold. You would notice that the two halves move apart. This shows that 
the strip has acquired charge on ironing. When you fold it into half, both 
the halves have the same charge. Hence they repel each other. The 



























same effect is seen in the leaf electroscope. On charging the curtain rod 
by touching the ball end with an electrified body, charge is transferred to 
the curtain rod and the attached aluminium foil. Both the halves of the foil 
get similar charge and therefore repel each other. The divergence in the 
leaves depends on the amount of charge on them. Let us first try to 
understand why material bodies acquire charge. 

You know that all matter is made up of atoms and/or molecules. Although 
normally the materials are electrically neutral, they do contain charges; 
but their charges are exactly balanced. Forces that hold the molecules 
together, forces that hold atoms together in a solid, the adhesive force of 
glue, forces associated with surface tension, all are basically electrical in 
nature, arising from the forces between charged particles. Thus the 
electric force is all pervasive and it encompasses almost each and every 
field associated with our life. It is therefore essential that we learn more 
about such a force. 


(a) 




(c) 

FIGURE 1.3 Paper strip experiment. 


To electrify a neutral body, we need to add or remove one kind of charge. 






When we say that a body is charged, we always refer to this excess 
charge or deficit of charge. In solids, some of the electrons, being less 
tightly bound in the atom, are the charges which are transferred from one 
body to the other. A body can thus be charged positively by losing some 
of its electrons. Similarly, a body can be charged negatively by gaining 
electrons. When we rub a glass rod with silk, some of the electrons from 
the rod are transferred to the silk cloth. Thus the rod gets positively 
charged and the silk gets negatively charged. No new charge is created 
in the process of rubbing. Also the number of electrons, that are 
transferred, is a very small fraction of the total number of electrons in the 
material body. Also only the less tightly bound electrons in a material 
body can be transferred from it to another by rubbing. Therefore, when a 
body is rubbed with another, the bodies get charged and that is why we 
have to stick to certain pairs of materials to notice charging on rubbing 
the bodies. 

1.3 CONDUCTORS AND INSULATORS 

A metal rod held in hand and rubbed with wool will not show any sign of 
being charged. However, if a metal rod with a wooden or plastic handle is 
rubbed without touching its metal part, it shows signs of charging. 
Suppose we connect one end of a copper wire to a neutral pith ball and 
the other end to a negatively charged plastic rod. We will find that the pith 
ball acquires a negative charge. If a similar experiment is repeated with a 
nylon thread or a rubber band, no transfer of charge will take place from 
the plastic rod to the pith ball. Why does the transfer of charge not take 
place from the rod to the ball? 

Some substances readily allow passage of electricity through them, 
others do not. Those which allow electricity to pass through them easily 
are called conductors. They have electric charges (electrons) that are 


comparatively free to move inside the material. Metals, human and 
animal bodies and earth are conductors. Most of the non-metals like 
glass, porcelain, plastic, nylon, wood offer high resistance to the passage 
of electricity through them. They are called insulators. Most substances 
fall into one of the two classes stated above*. 

When some charge is transferred to a conductor, it readily gets 
distributed over the entire surface of the conductor. In contrast, if some 
charge is put on an insulator, it stays at the same place. You will learn 
why this happens in the next chapter. 

This property of the materials tells you why a nylon or plastic comb gets 
electrified on combing dry hair or on rubbing, but a metal article like 
spoon does not. The charges on metal leak through our body to the 
ground as both are conductors of electricity. 

When we bring a charged body in contact with the earth, all the excess 
charge on the body disappears by causing a momentary current to pass 
to the ground through the connecting conductor (such as our body). This 
process of sharing the charges with the earth is called grounding or 
earthing. Earthing provides a safety measure for electrical circuits and 
appliances. A thick metal plate is buried deep into the earth and thick 
wires are drawn from this plate; these are used in buildings for the 
purpose of earthing near the mains supply. The electric wiring in our 
houses has three wires: live, neutral and earth. The first two carry electric 
current from the power station and the third is earthed by connecting it to 
the buried metal plate. Metallic bodies of the electric appliances such as 
electric iron, refrigerator, TV are connected to the earth wire. When any 
fault occurs or live wire touches the metallic body, the charge flows to the 
earth without damaging the appliance and without causing any injury to 
the humans; this would have otherwise been unavoidable since the 
human body is a conductor of electricity. 



* There is a third category called semiconductors, which offer resistance 
to the movement of charges which is intermediate between the 
conductors and insulators. 

1.4 Charging by Induction 

When we touch a pith ball with an electrified plastic rod, some of the 
negative charges on the rod are transferred to the pith ball and it also 
gets charged. Thus the pith ball is charged by contact. It is then repelled 
by the plastic rod but is attracted by a glass rod which is oppositely 
charged. However, why a electrified rod attracts light objects, is a 
question we have still left unanswered. Let us try to understand what 
could be happening by performing the following experiment. 


A B 
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Figure 1.4 Charging by induction. 
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(i) Bring two metal spheres, A and B, supported on insulating stands, in 
contact as shown in Fig. 1.4(a). 

(ii) Bring a positively charged rod near one of the spheres, say A, taking 
care that it does not touch the sphere. The free electrons in the spheres 
are attracted towards the rod. This leaves an excess of positive charge 
on the rear surface of sphere B. Both kinds of charges are bound in the 
metal spheres and cannot escape. They, therefore, reside on the 
surfaces, as shown in Fig. 1.4(b). The left surface of sphere A, has an 
excess of negative charge and the right surface of sphere B, has an 


























excess of positive charge. However, not all of the electrons in the 
spheres have accumulated on the left surface of A. As the negative 
charge starts building up at the left surface of A, other electrons are 
repelled by these. In a short time, equilibrium is reached under the action 
of force of attraction of the rod and the force of repulsion due to the 
accumulated charges. Fig. 1.4(b) shows the equilibrium situation. The 
process is called induction of charge and happens almost instantly. The 
accumulated charges remain on the surface, as shown, till the glass rod 
is held near the sphere. If the rod is removed, the charges are not acted 
by any outside force and they redistribute to their original neutral state. 

(iii) Separate the spheres by a small distance while the glass rod is still 
held near sphere A, as shown in Fig. 1.4(c). The two spheres are found 
to be oppositely charged and attract each other. 

(iv) Remove the rod. The charges on spheres rearrange themselves as 
shown in Fig. 1.4(d). Now, separate the spheres quite apart. The charges 
on them get uniformly distributed over them, as shown in 

Fig. 1.4(e). 

In this process, the metal spheres will each be equal and oppositely 
charged. This is charging by induction. The positively charged glass rod 
does not lose any of its charge, contrary to the process of charging by 
contact. 

When electrified rods are brought near light objects, a similar effect takes 
place. The rods induce opposite charges on the near surfaces of the 
objects and similar charges move to the farther side of the object. [This 
happens even when the light object is not a conductor. The mechanism 
for how this happens is explained later in Sections 1.10 and 2.10.] The 
centres of the two types of charges are slightly separated. We know that 
opposite charges attract while similar charges repel. However, the 



magnitude of force depends on the distance between the charges and in 
this case the force of attraction overweighs the force of repulsion. As a 
result the particles like bits of paper or pith balls, being light, are pulled 
towards the rods. 
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Interactive animation on charging a two-sphere system by induction: 
http://www.physicsclassroom.com/mmedia/estatics/itsn.cfm 


Example 1.1 How can you charge a metal sphere positively without 
touching it? 

Solution Figure 1.5(a) shows an uncharged metallic sphere on an 
insulating metal stand. Bring a negatively charged rod close to the 
metallic sphere, as shown in Fig. 1.5(b). As the rod is brought close 
to the sphere, the free electrons in the sphere move away due to 
repulsion and start piling up at the farther end. The near end 
becomes positively charged due to deficit of electrons. This process 
of charge distribution stops when the net force on the free electrons 
inside the metal is zero. Connect the sphere to the ground by a 
conducting wire. The electrons will flow to the ground while the 
positive charges at the near end will remain held there due to the 
attractive force of the negative charges on the rod, as shown in Fig. 
1.5(c). Disconnect the sphere from the ground. The positive charge 
continues to be 










held at the near end [Fig. 1.5(d)]. Remove the electrified rod. The 
positive charge will spread uniformly over the sphere as shown in Fig. 
1.5(e). 





Figure 1.5 


In this experiment, the metal sphere gets charged by the process of 
induction and the rod does not lose any of its charge. 

Similar steps are involved in charging a metal sphere negatively by 
induction, by bringing a positively charged rod near it. In this case the 
electrons will flow from the ground to the sphere when the sphere is 
connected to the ground with a wire. Can you explain why? 


1.5 Basic Properties of Electric Charge 

We have seen that there are two types of charges, namely positive and 
negative and their effects tend to cancel each other. Here, we shall now 


































describe some other properties of the electric charge. 

If the sizes of charged bodies are very small as compared to the 
distances between them, we treat them as point charges. All the 
charge content of the body is assumed to be concentrated at one point 
in space. 

1.5.1 Additivity of charges 

We have not as yet given a quantitative definition of a charge; we shall 
follow it up in the next section. We shall tentatively assume that this can 
be done and proceed. If a system contains two point charges q-| and q 2 , 

the total charge of the system is obtained simply by adding algebraically 
q 1 and q 2 , i.e., charges add up like real numbers or they are scalars like 

the mass of a body. If a system contains n charges q 1 , q 2 , q 3 , ..., q n , then 

the total charge of the system is q 1 + q 2 + q 3 + ... + q n . Charge has 

magnitude but no direction, similar to mass. However, there is one 
difference between mass and charge. Mass of a body is always positive 
whereas a charge can be either positive or negative. Proper signs have 
to be used while adding the charges in a system. For example, the 
total charge of a system containing five charges +1, +2, -3, +4 and -5, 
in some arbitrary unit, is (+1) + (+2) + (-3) + (+4) + (-5) = -1 in the 
same unit. 

1.5.2 Charge is conserved 

We have already hinted to the fact that when bodies are charged by 
rubbing, there is transfer of electrons from one body to the other; no new 
charges are either created or destroyed. A picture of particles of electric 
charge enables us to understand the idea of conservation of charge. 
When we rub two bodies, what one body gains in charge the other body 



loses. Within an isolated system consisting of many charged bodies, due 
to interactions among the bodies, charges may get redistributed but it is 
found that the total charge of the isolated system is always conserved. 
Conservation of charge has been established experimentally. 

It is not possible to create or destroy net charge carried by any isolated 
system although the charge carrying particles may be created or 
destroyed in a process. Sometimes nature creates charged particles: a 
neutron turns into a proton and an electron. The proton and electron thus 
created have equal and opposite charges and the total charge is zero 
before and after the creation. 

1.5.3 Quantisation of charge 

Experimentally it is established that all free charges are integral multiples 
of a basic unit of charge denoted by e. Thus charge q on a body is 
always given by 

q = ne 

where n is any integer, positive or negative. This basic unit of charge is 
the charge that an electron or proton carries. By convention, the charge 
on an electron is taken to be negative; therefore charge on an electron is 
written as -e and that on a proton as +e. 

The fact that electric charge is always an integral multiple of e is termed 
as quantisation of charge. There are a large number of situations in 
physics where certain physical quantities are quantised. The quantisation 
of charge was first suggested by the experimental laws of electrolysis 
discovered by English experimentalist Faraday. It was experimentally 
demonstrated by Millikan in 1912. 

In the International System (SI) of Units, a unit of charge is called a 
coulomb and is denoted by the symbol C. A coulomb is defined in terms 



the unit of the electric current which you are going to learn in a 
subsequent chapter. In terms of this definition, one coulomb is the charge 
flowing through a wire in 1 s if the current is 1 A (ampere), (see Chapter 2 
of Class XI, Physics Textbook , Part I). In this system, the value of the 
basic unit of charge is 

-19 

e = 1.602192 x 10 C 

18 

Thus, there are about 6 x 10 electrons in a charge of -1C. In 
electrostatics, charges of this large magnitude are seldom encountered 

and hence we use smaller units 1 pC (micro coulomb) = 10 -6 C or 1 mC 
(milli coulomb) = 10 -3 C. 

If the protons and electrons are the only basic charges in the universe, all 
the observable charges have to be integral multiples of e. Thus, if a body 
contains n 1 electrons and n 2 protons, the total amount of charge on the 

body is n 2 x e + n 1 x (_e) = (n 2 - n-|) e. Since n 1 and n 2 are integers, their 

difference is also an integer. Thus the charge on any body is always an 
integral multiple of e and can be increased or decreased also in steps of 

e. 

The step size e is, however, very small because at the macroscopic level, 
we deal with charges of a few pC. At this scale the fact that charge of a 
body can increase or decrease in units of e is not visible. In this respect, 
the grainy nature of the charge is lost and it appears to be continuous. 

This situation can be compared with the geometrical concepts of points 
and lines. A dotted line viewed from a distance appears continuous to 
us but is not continuous in reality. As many points very close to 
each other normally give an impression of a continuous line, many 
small charges taken together appear as a continuous charge 



distribution. 


At the macroscopic level, one deals with charges that are enormous 
compared to the magnitude of charge e. Since e = 1.6 * 10 -19 C, a 

charge of magnitude, say 1 pC, contains something like 10 13 times the 
electronic charge. At this scale, the fact that charge can increase or 
decrease only in units of e is not very different from saying that charge 
can take continuous values. Thus, at the macroscopic level, the 
quantisation of charge has no practical consequence and can be ignored. 
However, at the microscopic level, where the charges involved are of the 
order of a few tens or hundreds of e, i.e., they can be counted, they 
appear in discrete lumps and quantisation of charge cannot be ignored. It 
is the magnitude of scale involved that is very important. 


Example 1.2 If 10 9 electrons move out of a body to another body 
every second, how much time is required to get a total charge of 1 C 
on the other body? 

9 

Solution In one second 10 electrons move out of the body. 

Therefore the charge given out in one second is 1.6 x 10 -19 x 10 9 C = 
-10 

1.6 x 10 C. 

The time required to accumulate a charge of 1 C can then be 

estimated to be 1 C + (1.6 * lO” 10 C/s) = 6.25 * 10 9 s = 6.25 * 10 9 + 
(365 x 24 x 3600) years = 198 years. Thus to collect a charge of one 

coulomb, from a body from which 10 9 electrons move out every 
second, we will need approximately 200 years. One coulomb is, 
therefore, a very large unit for many practical purposes. 



It is, however, also important to know what is roughly the number of 
electrons contained in a piece of one cubic centimetre of a material. A 

cubic piece of copper of side 1 cm contains about 2.5 x 10 24 
electrons. 


Example 1.3 How much positive and negative charge is there in a 
cup of water? 

Solution Let us assume that the mass of one cup of water is 
250 g. The molecular mass of water is 18g. Thus, one mole 

(= 6.02 x 10 23 molecules) of water is 18 g. Therefore the number of 
molecules in one cup of water is (250/18) * 6.02 x io 23 . 

Each molecule of water contains two hydrogen atoms and one 
oxygen atom, i.e., 10 electrons and 10 protons. Hence the total 
positive and total negative charge has the same magnitude. It is 
equal to 

(250/18) x 6.02 x I0 23 x IQ x 1.6 x 1Q- 19 C = 1.34 x 10 7 C. 


1.6 Coulomb’s Law 

Coulomb’s law is a quantitative statement about the force between two 
point charges. When the linear size of charged bodies are much smaller 
than the distance separating them, the size may be ignored and the 
charged bodies are treated as point charges. Coulomb measured the 
force between two point charges and found that it varied inversely as the 
square of the distance between the charges and was directly proportional 
to the product of the magnitude of the two charges and acted along the 




line joining the two charges. Thus, if two point charges q 1; q 2 are 

separated by a distance r in vacuum, the magnitude of the force (F) 
between them is given by 

F - k - — (1-1) 
r 2 

How did Coulomb arrive at this law from his experiments? Coulomb used 
a torsion balance* for measuring the force between two charged metallic 
spheres. When the separation between two spheres is much larger than 
the radius of each sphere, the charged spheres may be regarded as 
point charges. However, the charges on the spheres were unknown, to 
begin with. How then could he discover a relation like Eq. (1.1)? Coulomb 
thought of the following simple way: Suppose the charge on a metallic 
sphere is q. If the sphere is put in contact with an identical uncharged 
sphere, the charge will spread over the two spheres. By symmetry, the 
charge on each sphere will be q/2*. Repeating this process, we can get 
charges q/2, q/4, etc. Coulomb varied the distance for a fixed pair of 
charges and measured the force for different separations. He then varied 
the charges in pairs, keeping the distance fixed for each pair. Comparing 
forces for different pairs of charges at different distances, Coulomb 
arrived at the relation, Eq. (1.1). 

* A torsion balance is a sensitive device to measure force. It was also 
used later by Cavendish to measure the very feeble gravitational force 
between two objects, to verify Newton’s Law of Gravitation. 

Coulomb’s law, a simple mathematical statement, was initially 
experimentally arrived at in the manner described above. While the 
original experiments established it at a macroscopic scale, it has also 

been established down to subatomic level (r - 10 -10 m). 




Charles Augustin de Coulomb (1736 - 1806) Coulomb, a French 
physicist, began his career as a military engineer in the West Indies. 

In 1776, he returned to Paris and retired to a small estate to do his 
scientific research. He invented a torsion balance to measure the 
quantity of a force and used it for determination of forces of electric 
attraction or repulsion between small charged spheres. He thus 
arrived in 1785 at the inverse square law relation, now known as 
Coulomb’s law. The law had been anticipated by Priestley and also 
by Cavendish earlier, though Cavendish never published his results. 
Coulomb also found the inverse square law of force between unlike 
and like magnetic poles. 

Coulomb discovered his law without knowing the explicit magnitude of 
the charge. In fact, it is the other way round: Coulomb’s law can now be 
employed to furnish a definition for a unit of charge. In the relation, Eq. 
(1.1), k is so far arbitrary. We can choose any positive value of k. The 
choice of k determines the size of the unit of charge. In SI units, the value 

of k is about 9 * 10 9 The unit of charge that results from this choice 

is called a coulomb which we defined earlier in Section 1.4. Putting this 
value of k in Eq. (1.1), we see that for 
q-i = q 2 = 1 C, r= 1 m 

F = 9 x IQ 9 N 






That is, 1 C is the charge that when placed at a distance of 1 m from 
another charge of the same magnitude in vacuum experiences an 

electrical force of repulsion of magnitude 9 * 10 9 N. One coulomb is 
evidently too big a unit to be used. In practice, in electrostatics, one uses 
smaller units like 1 mC or 1 pC. 

The constant k in Eq. (1.1) is usually put as 

k = 1/4tt£ 0 for later convenience, so that Coulomb’s law is written as 


4 l L 


C) (1 

' ( 1 . 2 ) 
r~ 


£ 0 is called the permittivity of free space . The value of £ 0 in SI units is 


F = 8.854 x lO-^^lsMirr 2 

fc o 




(b) 

Figure 1.6 (a) Geometry and (b) Forces between charges. 

Since force is a vector, it is better to write Coulomb’s law in the vector 
notation. Let the position vectors of charges q<\ and q 2 be r 1 and r 2 

respectively [see Fig.1.6(a)]. We denote force on q 1 due to q 2 by F 12 and 

force on q 2 due to q-| by F 21 . The two point charges q-| and q 2 have been 

numbered 1 and 2 for convenience and the vector leading from 1 to 2 is 
denoted by r 21 : 

r 2i = r 2 - ri 

In the same way, the vector leading from 2 to 1 is denoted by r 12 : 


r i2 = r 1 - r 2 = - r 21 


The magnitude of the vectors r 21 and r 12 is denoted by r 21 and r 12j 

respectively (r 12 = r 21 ). The direction of a vector is specified by a unit 

vector along the vector. To denote the direction from 1 to 2 (or from 2 to 
1), we define the unit vectors: 

* Implicit in this is the assumption of additivity of charges and 
conservation: two charges (q/2 each) add up to make a total charge q. 

r 2 i=— ria = —, r 3 i = ria 
r 21 4 T 12 

Coulomb’s force law between two point charges q 1 and q 2 located at r 1 
and r 2 is then expressed as 


*21 = 


4l£„ 


qfi <h 
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Some remarks on Eq. (1.3) are relevant: 

• Equation (1.3) is valid for any sign of q 1 and q 2 whether positive or 

negative. If q 1 and q 2 are of the same sign (either both positive or both 

negative), F 21 is along r 21 , which denotes repulsion, as it should be for 

like charges. If q-, and q 2 are of opposite signs, F 21 is along - £- 21 (= £. 

12 ), which denotes attraction, as expected for unlike charges. Thus, we 

do not have to write separate equations for the cases of like and unlike 
charges. Equation (1.3) takes care of both cases correctly [Fig. 1.6(b)]. 

• The force F 12 on charge q-| due to charge q 2 , is obtained from Eq. (1.3), 
by simply interchanging 1 and 2, i.e., 


Thus, Coulomb’s law agrees with the Newton’s third law. 






• Coulomb’s law [Eq. (1.3)] gives the force between two charges q-| and 

q 2 in vacuum. If the charges are placed in matter or the intervening space 

has matter, the situation gets complicated due to the presence of charged 
constituents of matter. We shall consider electrostatics in matter in the 
next chapter. 
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Interactive animation on Coulomb’s law: 

http://webphysics.davidson.edu/physlet_resources/bu_semester2 
/menu semester2.html 


Example 1.4 Coulomb’s law for electrostatic force between two point 
charges and Newton’s law for gravitational force between two 
stationary point masses, both have inverse-square dependence on 
the distance between the charges and masses respectively. 

(a) Compare the strength of these forces by determining the ratio of 
their magnitudes (i) for an electron and a proton and (ii) for two 
protons, (b) Estimate the accelerations of electron and proton due to 
the electrical force of their mutual attraction when they are 

1 A (= lO’ 10 m) apart? (m p = 1.67 x l()- 27 kg, m e = 9.11 x 10“ 31 kg) 

Solution 







(a) (i) The electric force between an electron and a proton at a 
distance r apart is: 


where the negative sign indicates that the force is attractive. The 
corresponding gravitational force (always attractive) is: 

m„ m . 

*r. ~ -V ~ ~ ' 

r~ 

where m p and m e are the masses of a proton and an electron 
respectively. 



4'it^G/n^n., 


= 2.4x10°- 


(ii) On similar lines, the ratio of the magnitudes of electric force to the 
gravitational force between two protons at a distance r apart is: 



4ic%Gm m 


1.3 x IQ 36 


However, it may be mentioned here that the signs of the two forces 
are different. For two protons, the gravitational force is attractive in 
nature and the Coulomb force is repulsive. The actual values of these 
forces between two protons inside a nucleus (distance between two 

protons is ~ 10" 15 m inside a nucleus) are F e - 230 N, whereas, F G ~ 
1.9 x 1Q- 34 N. 


The (dimensionless) ratio of the two forces shows that electrical 
forces are enormously stronger than the gravitational forces. 

(b) The electric force F exerted by a proton on an electron is same in 










magnitude to the force exerted by an electron on a proton; however, 
the masses of an electron and a proton are different. Thus, the 
magnitude of force is 

|F| = —^—^r = 8.987 x 10 9 Nm 2 /C 2 x (1.6 xio- 19 C) 2 /(10- 10 m) 2 

4*s 0 r 

= 2.3 x 10- 8 N 

Using Newton’s second law of motion, F = ma, the acceleration that 
an electron will undergo is 

a = 2.3x10“ 8 N / 9.11 xio- 31 kg = 2.5 x io 22 m/s 2 

Comparing this with the value of acceleration due to gravity, we can 
conclude that the effect of gravitational field is negligible on the 
motion of electron and it undergoes very large accelerations under 
the action of Coulomb force due to a proton. 

The value for acceleration of the proton is 
2.3 x IO” 8 N/ 1.67 x 10 -27 kg = 1.4 x I0 19 m/s 2 


Example 1.5 A charged metallic sphere A is suspended by a nylon 
thread. Another charged metallic sphere B held by an insulating 
handle is brought close to A such that the distance between their 
centres is 10 cm, as shown in Fig. 1.7(a). The resulting repulsion of A 
is noted (for example, by shining a beam of light and measuring the 
deflection of its shadow on a screen). Spheres A and B are touched 
by uncharged spheres C and D respectively, as shown in Fig. 1.7(b). 
C and D are then removed and B is brought closer to A to a 
distance of 5.0 cm between their centres, as shown in Fig. 1.7(c). 







What is the expected repulsion of A on the basis of Coulomb’s law? 
Spheres A and C and spheres B and D have identical sizes. Ignore 
the sizes of A and B in comparison to the separation between their 
centres. 
























neglecting the sizes of spheres A and B in comparison to r. When an 
identical but uncharged sphere C touches A, the charges redistribute 
on A and C and, by symmetry, each sphere carries a charge q/2. 
Similarly, after D touches B, the redistributed charge on each is 
q'/2. Now, if the separation between A and B is halved, the magnitude 
of the electrostatic force on each is 

F , = 1 iq/2)iq'/ 2) = 1 iqg] = 

4tt£ 0 07 2) 2 4?ie 0 r 2 

Thus the electrostatic force on A, due to B, remains unaltered. 


1.7 FORCES BETWEEN MULTIPLE CHARGES 

The mutual electric force between two charges is given by Coulomb’s 
law. How to calculate the force on a charge where there are not one but 
several charges around? Consider a system of n stationary charges q 1 , 

q 2 , q 3 , ..., qn in vacuum. What is the force on q 1 due to q 2 , q 3 , ..., q n ? 

Coulomb’s law is not enough to answer this question. Recall that forces 
of mechanical origin add according to the parallelogram law of addition. 
Is the same true for forces of electrostatic origin? 

Experimentally, it is verified that force on any charge due to a number of 
other charges is the vector sum of all the forces on that charge due to the 
other charges, taken one at a time. The individual forces are unaffected 
due to the presence of other charges. This is termed as the principle of 
superposition. 

Figure 1.8 A system of (a) three charges (b) multiple charges. 









To better understand the concept, consider a system of three charges 
q 2 and q 3 , as shown in 

Fig. 1.8(a). The force on one charge, say q 1 , due to two other charges q 2 , 
q 3 can therefore be obtained by performing a vector addition of the forces 
due to each one of these charges. Thus, if the force on q 1 due to q 2 is 
denoted by F 12 , F 12 is given by Eq. (1.3) even though other charges are 
present. 





Thus, F 


1 <M2 


12 


4*e 0 r ,2 12 


In the same way, the force on q 1 due to q 3 , denoted by F 13 , is given by 


r- _ 1 i 

r 13 - A ^ 2 r 13 

4jle 0 r 13 


which again is the Coulomb force on q 1 due to q 3 , even though other 






charge q 2 is present. 

Thus the total force F on q 1 due to the two charges q 2 and q 3 is 
given as 


F, = F 12 + F 


1 <M2 - 


r 10 + 


13 4** r* 12 


1 q, q s . 

J 3^r 13 (1.4) 


4*% /is 

The above calculation of force can be generalised to a system of charges 
more than three, as shown in Fig. 1.8(b). 

The principle of superposition says that in a system of charges q 1 , 
q 2 , ..., q n , the force on q 1 due to q 2 is the same as given by Coulomb’s 
law, i.e., it is unaffected by the presence of the other charges q 3 , q 4 , ..., 
qn. The total force F 1 on the charge q 1; due to all other charges, is then 
given by the vector sum of the forces F 12 , F 13 , ..., F in : 


i.e., 


Fi = F 12 + F 13 + ...+ F ln 


4jib 


0 


mk - , ms ~ 

.2 r 12 + .2 r 13 ~ t ~ 


r 12 


13 


Mn , 

,.2 r lfi 

'in 


^ C k 


4 ^^ t =2 hi 


Lzk* 1 . ( 1 - 5 ) 


The vector sum is obtained as usual by the parallelogram law of addition 
of vectors. All of electrostatics is basically a consequence of Coulomb’s 
law and the superposition principle. 

Example 1.6 Consider three charges q^ q 2 , q 3 each equal to q at the 

vertices of an equilateral triangle of side I. What is the force on a 
charge Q (with the same sign as q) placed at the centroid of the 














triangle, as shown in Fig. 1.9? 


A 

* q,=q 
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Figure 1.9 


Solution In the given equilateral triangle ABC of sides of length I, if 
we draw a perpendicular AD to the side BC, 

AD = AC cos 30° = ( v '3 / 2 ) I and the distance AO of the centroid O 
from A is (2/3) AD = (l / v 3 ) I. By symmatry AO = BO = CO. 

Thus, 

Force F 1 on Q due to charge q at A = along AO 

4tt^ f 

3 Qq 

Force F 2 on Q due to charge q at B = ' along BO 

4tt^ f 

3 Qq 

Force F 3 on Q due to charge q at C = — — along CO 

3 Qci 

The resultant of forces F 2 and F 3 is ^ '. 2 along OA, by the 















parallelogram law. Therefore, the total force on Q = — %j-(i - r) 

471 ^ l 

= 0, where is the unit vector along OA. 

It is clear also by symmetry that the three forces will sum to zero. 
Suppose that the resultant force was non-zero but in some direction. 
Consider what would happen if the system was rotated through 60° 
about O. 


Example 1.7 Consider the charges q, q, and -q placed at the 
vertices of an equilateral triangle, as shown in Fig. 1.10. What is the 
force on each charge? 
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Figure 1.10 


Solution The forces acting on charge q at A due to charges q at B 
and -q at C are F 12 along BA and F 13 along AC respectively, as 

shown in Fig. 1.10. By the parallelogram law, the total force F 1 on the 









charge q at A is given by 

F 1 = F fj where r, is a unit vector along BC. 

The force of attraction or repulsion for each pair of charges has the 

q 2 

same magnitude F = —-— T 

4 7T e 0 l 2 

The total force F 2 on charge q at B is thus F 2 = F r 2 , where r 2 is a 
unit vector along AC. 

Similarly the total force on charge -q at C is F 3 = ^3 F fi , where h 

is the unit vector along the direction bisecting the ZBCA. 

It is interesting to see that the sum of the forces on the three charges 
is zero, i.e., 

F 1 + F 2 + f 3 = 0 

The result is not at all surprising. It follows straight from the fact that 
Coulomb’s law is consistent with Newton’s third law. The proof is left 
to you as an exercise. 


1.8 ELECTRIC FIELD 

Let us consider a point charge Q placed in vacuum, at the origin O. If we 
place another point charge q at a point P, where OP = r, then the charge 
Q will exert a force on q as per Coulomb’s law. We may ask the question: 
If charge q is removed, then what is left in the surrounding? Is there 
nothing? If there is nothing at the point P, then how does a force act when 
we place the charge q at P. In order to answer such questions, the early 
scientists introduced the concept of field. According to this, we say that 
the charge Q produces an electric field everywhere in the surrounding. 




When another charge q is brought at some point P, the field there acts on 
it and produces a force. The electric field produced by the charge Q at a 
point r is given as 


E(r) 


1 

4lte 0 



1 

4*e 0 



(1.6) 


where r = r/r, is a unit vector from the origin to the point r. Thus, Eq. 
(1.6) specifies the value of the electric field for each value of the position 
vector r. The word “field” signifies how some distributed quantity (which 
could be a scalar or a vector) varies with position. The effect of the 
charge has been incorporated in the existence of the electric field. We 
obtain the force F exerted by a charge Q on a charge q, as 


1 Qcj. 

4rce 0 r 2 


(1.7) 


Note that the charge q also exerts an equal and opposite force on the 
charge Q. The electrostatic force between the charges Q and q can be 
looked upon as an interaction between charge q and the electric field of 
Q and vice versa. If we denote the position of charge q by the vector r, it 
experiences a force F equal to the charge q multiplied by the electric 
field E at the location of q. Thus, 




(b) 

Figure 1.11 Electric field (a) due to a charge Q, (b) due to a charge -Q. 

F(r) = q E(r) (1.8) 

Equation (1.8) defines the SI unit of electric field as N/C*. 

Some important remarks may be made here: 

(i) From Eq. (1.8), we can infer that if q is unity, the electric field due to a 
charge Q is numerically equal to the force exerted by it. Thus, the electric 
field due to a charge Q at a point in space may be defined as the force 
that a unit positive charge would experience if placed at that point. The 
charge Q, which is producing the electric field, is called a source charge 







and the charge q, which tests the effect of a source charge, is called a 
test charge. Note that the source charge Q must remain at its original 
location. However, if a charge q is brought at any point around Q, Q itself 
is bound to experience an electrical force due to q and will tend to move. 
A way out of this difficulty is to make q negligibly small. The force F is 
then negligibly small but the ratio F/q is finite and defines the electric 
field: 

f p ^ 

E = lim (1.9) 

l Q J 

A practical way to get around the problem (of keeping Q undisturbed in 
the presence of q) is to hold Q to its location by unspecified forces! This 
may look strange but actually this is what happens in practice. When we 
are considering the electric force on a test charge q due to a charged 
planar sheet (Section 1.15), the charges on the sheet are held to their 
locations by the forces due to the unspecified charged constituents inside 
the sheet. 

* An alternate unit V/m will be introduced in the next chapter. 

(ii) Note that the electric field E due to Q, though defined operationally in 
terms of some test charge q, is independent of q. This is because 

F is proportional to q, so the ratio F/q does not depend on q. The force F 
on the charge q due to the charge Q depends on the particular location of 
charge q which may take any value in the space around the charge Q. 
Thus, the electric field E due to Q is also dependent on the space 
coordinate r. For different positions of the charge q all over the space, we 
get different values of electric field E. The field exists at every point in 
three-dimensional space. 

(iii) For a positive charge, the electric field will be directed radially 
outwards from the charge. On the other hand, if the source charge is 



negative, the electric field vector, at each point, points radially inwards. 

(iv) Since the magnitude of the force F on charge q due to charge Q 
depends only on the distance r of the charge q from charge Q, 
the magnitude of the electric field E will also depend only on the distance 
r. Thus at equal distances from the charge Q, the magnitude of its electric 
field E is same. The magnitude of electric field E due to a point charge is 
thus same on a sphere with the point charge at its centre; in other words, 
it has a spherical symmetry. 


1.8.1 Electric field due to a system of charges 

Consider a system of charges q 1 , q 2 , ---, qn with position vectors r 1; 

r 2 , ..., rn relative to some origin O. Like the electric field at a point in 

space due to a single charge, electric field at a point in space due to the 
system of charges is defined to be the force experienced by a unit 
test charge placed at that point, without disturbing the original 
positions of charges q 1; q 2 , ..., qn. We can use Coulomb’s law and the 

superposition principle to determine this field at a point P denoted by 
position vector r. 

Electric field E 1 at r due to q-| at r 1 is given by 


E, = 


4^ 


q, 

— r 

.2 IF 


where r u , is a unit vector in the direction from q 1 to P, and r 1P is the 
distance between q 1 and P. 

In the same manner, electric field E 2 at r due to q 2 at 



where t 2V is a unit vector in the direction from q 2 to P and r 2P is the 
distance between q2 and P. Similar expressions hold good for fields E 3 , 
E 4 , E n due to charges q 3 , q 4 , q n . 


By the superposition principle, the electric field E at r due to the system 
of charges is (as shown in Fig. 1.12) 

EO-^E, (r) + E 2 (r) + ... + E n (r) 


1 ft 
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E(r) 
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E is a vector quantity that varies from one point to another point in space 
and is determined from the positions of the source charges. 


1.8.2 Physical significance of electric field 

You may wonder why the notion of electric field has been introduced here 
at all. After all, for any system of charges, the measurable quantity is the 
force on a charge which can be directly determined using Coulomb’s law 
and the superposition principle [Eq. (1.5)]. Why then introduce this 
intermediate quantity called the electric field? 






FIGURE 1.12 Electric field at a point due to a system of charges is the vector sum of the electric 

fields at the point due to individual charges. 

For electrostatics, the concept of electric field is convenient, but not really 
necessary. Electric field is an elegant way of characterising the electrical 
environment of a system of charges. Electric field at a point in the space 
around a system of charges tells you the force a unit positive test charge 
would experience if placed at that point (without disturbing the system). 
Electric field is a characteristic of the system of charges and is 
independent of the test charge that you place at a point to determine the 
field. The term field in physics generally refers to a quantity that is 
defined at every point in space and may vary from point to point. Electric 
field is a vector field, since force is a vector quantity. 

The true physical significance of the concept of electric field, however, 
emerges only when we go beyond electrostatics and deal with time- 
dependent electromagnetic phenomena. Suppose we consider the force 
between two distant charges q 1 , q 2 in accelerated motion. Now the 

greatest speed with which a signal or information can go from one point 
to another is c, the speed of light. Thus, the effect of any motion of q-| on 





q 2 cannot arise instantaneously. There will be some time delay between 

the effect (force on q 2 ) and the cause (motion of q-|). It is precisely here 

that the notion of electric field (strictly, electromagnetic field) is natural 
and very useful. The field picture is this: the accelerated motion of charge 
q-i produces electromagnetic waves, which then propagate with the 

speed c, reach q 2 and cause a force on q 2 . The notion of field elegantly 

accounts for the time delay. Thus, even though electric and magnetic 
fields can be detected only by their effects (forces) on charges, they are 
regarded as physical entities, not merely mathematical constructs. They 
have an independent dynamics of their own, i.e., they evolve according to 
laws of their own. They can also transport energy. Thus, a source of time- 
dependent electromagnetic fields, turned on for a short interval of time 
and then switched off, leaves behind propagating electromagnetic fields 
transporting energy. The concept of field was first introduced by Faraday 
and is now among the central concepts in physics. 

Example 1.8 An electron falls through a distance of 1.5 cm in a 

uniform electric field of magnitude 2.0 x 10 4 N C -1 [Fig. 1.13(a)]. The 
direction of the field is reversed keeping its magnitude unchanged 
and a proton falls through the same distance [Fig. 1.13(b)]. Compute 
the time of fall in each case. Contrast the situation with that of ‘free 
fall under gravity’. 

- _+ _ + _ + 
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(a) (b) 

Figure 1.13 





Solution In Fig. 1.13(a) the field is upward, so the negatively charged 
electron experiences a downward force of magnitude eE where E is 
the magnitude of the electric field. The acceleration of the electron is 

a e = eE/m e 

where m e is the mass of the electron. 

Starting from rest, the time required by the electron to fall through a 


distance h is given by j = j - 

\ a. \ ah 


Fore = 1.6 x 10- 19 C, m e = 9.11 x icr 31 kg, 

E = 2.0 x 10 4 N C" 1 , h = 1.5 x 10- 2 m, 
t e = 2.9 x 10 _9 s 

In Fig. 1.13 (b), the field is downward, and the positively charged 
proton experiences a downward force of magnitude eE. The 
acceleration of the proton is 

a p = eE/rrip 

where m p is the mass of the proton; m p = 1.67 x 10 -27 kg. The time of 
fall for the proton is 



Thus, the heavier particle (proton) takes a greater time to fall through 
the same distance. This is in basic contrast to the situation of ‘free fall 
under gravity’ where the time of fall is independent of the mass of the 
body. Note that in this example we have ignored the acceleration due 












to gravity in calculating the time of fall. To see if this is justified, let us 
calculate the acceleration of the proton in the given electric field: 


a 


(1.6 x IQ" 15 C) _x_ (2.0 10 4 NC 4 ) 

1.67 y 10 27 


= 1.9 x 10 12 m ^ 


which is enormous compared to the value of g (9.8 m s -2 ), the 
acceleration due to gravity. The acceleration of the electron is even 
greater. Thus, the effect of acceleration due to gravity can be ignored 
in this example. 


Example 1.9 Two point charges q-| and q 2 , of magnitude +10 -8 C and 

-10 -8 C, respectively, are placed 0.1 m apart. Calculate the electric 
fields at points A, B and C shown in Fig. 1.14. 
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Figure 1.14 

Solution The electric field vector E 1A at A due to the positive charge 
q 1 points towards the right and has a magnitude 



(0 * 1 l\'nrC! 2 ) x (1 0 = C,) 
(0.05 in) 2 


= 3.6 x 10 4 NC- 1 


The electric field vector E 2A at A due to the negative charge q 2 points 

towards the right and has the same magnitude. Hence the magnitude 
of the total electric field E A at A is 


E a = E 1A + E 2A = 7.2 X 10 4 N C" 1 
E a is directed toward the right. 


The electric field vector E 1B at B due to the positive charge q 1 points 
towards the left and has a magnitude 




(0x 10°NirrC, -)y(10 y f;) 
(0.05 m) 2 


= 3.6 x 10 4 N C -1 


The electric field vector E 2B at B due to the negative charge q 2 points 
towards the right and has a magnitude 


E, 


(0 y 1 O'- Nnr O *■) y (10 y O) 
(0.15 m) : 


= 4 x IQ 3 N C- 1 


The magnitude of the total electric field at B is 
E b = E 1B - E 2B = 3.2 x 10 4 NC- 1 
E b is directed towards the left. 

The magnitude of each electric field vector at point C, due to charge 







q 1 and q 2 is 



(9xlQ s Wm 2 C- 2 )x(10- 8 C) = g x 1q3 N C -1 
(0.10m) 2 


The directions in which these two vectors point are indicated in 
Fig. 1.14. The resultant of these two vectors is 
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— = 9 x 10 3 NC- 1 

* y 


E c points towards the right. 


1.9 Electric Field Lines 

We have studied electric field in the last section. It is a vector quantity 
and can be represented as we represent vectors. Let us try to represent 
E due to a point charge pictorially. Let the point charge be placed at the 
origin. Draw vectors pointing along the direction of the electric field with 
their lengths proportional to the strength of the field at each point. Since 
the magnitude of electric field at a point decreases inversely as the 
square of the distance of that point from the charge, the vector gets 
shorter as one goes away from the origin, always pointing radially 
outward. Figure 1.15 shows such a picture. In this figure, each arrow 
indicates the electric field, i.e., the force acting on a unit positive charge, 
placed at the tail of that arrow. Connect the arrows pointing in one 
direction and the resulting figure represents a field line. We thus get 
many field lines, all pointing outwards from the point charge. Have we 
lost the information about the strength or magnitude of the field now, 
because it was contained in the length of the arrow? No. Now the 
magnitude of the field is represented by the density of field lines. E is 
strong near the charge, so the density of field lines is more near the 




charge and the lines are closer. Away from the charge, the field gets 
weaker and the density of field lines is less, resulting in well-separated 
lines. 

Another person may draw more lines. But the number of lines is not 
important. In fact, an infinite number of lines can be drawn in any region. 
It is the relative density of lines in different regions which is important. 

We draw the figure on the plane of paper, i.e., in two-dimensions but we 
live in three-dimensions. So if one wishes to estimate the density of field 
lines, one has to consider the number of lines per unit cross-sectional 
area, perpendicular to the lines. Since the electric field decreases as the 
square of the distance from a point charge and the area enclosing the 
charge increases as the square of the distance, the number of field lines 
crossing the enclosing area remains constant, whatever may be the 
distance of the area from the charge. 



FIGURE 1.15 Field of a point charge. 


We started by saying that the field lines carry information about the 










direction of electric field at different points in space. Having drawn a 
certain set of field lines, the relative density (i.e., closeness) of the field 
lines at different points indicates the relative strength of electric field at 
those points. The field lines crowd where the field is strong and are 
spaced apart where it is weak. Figure 1.16 shows a set of field lines. We 
can imagine two equal and small elements of area placed at points R and 
S normal to the field lines there. The number of field lines in our picture 
cutting the area elements is proportional to the magnitude of field at 
these points. The picture shows that the field at R is stronger than at S. 

To understand the dependence of the field lines on the area, or rather the 
solid angle subtended by an area element, let us try to relate the area 
with the solid angle, a generalisation of angle to three dimensions. Recall 
how a (plane) angle is defined in two-dimensions. Let a small transverse 
line element Al be placed at a distance r from a point O. Then the angle 
subtended by Al at O can be approximated as A0 = Al/r. Likewise, in 
three-dimensions the solid angle* subtended by a small perpendicular 
plane area AS, at a distance r, can be written as 

AQ = AS/r 2 . We know that in a given solid angle the number of radial field 
lines is the same. In Fig. 1.16, for two points P-| and P 2 at distances r 1 

and r 2 from the charge, the element of area subtending the solid angle 

AQ is r 2 AQ at P 1 and an element of area r 2 AQ at P 2 , respectively. The 

number of lines (say n) cutting these area elements are the same. The 
number of field lines, cutting unit area element is therefore n/( r 2 AQ) at 

P-) and n/(rfAQ) at P 2 , respectively. Since n and AQ are common, the 
strength of the field clearly has a 1/r 2 dependence. 



Region of 
weak field 



FIGURE 1.16 Dependence of electric field strength on the distance and its relation to the number 


of field lines. 

The picture of field lines was invented by Faraday to develop an intuitive 
non-mathematical way of visualising electric fields around charged 
configurations. Faraday called them lines of force. This term is somewhat 
misleading, especially in case of magnetic fields. The more appropriate 
term is field lines (electric or magnetic) that we have adopted in this book. 

Electric field lines are thus a way of pictorially mapping the electric field 
around a configuration of charges. An electric field line is, in general, a 
curve drawn in such a way that the tangent to it at each point is in the 
direction of the net field at that point. An arrow on the curve is obviously 
necessary to specify the direction of electric field from the two possible 
directions indicated by a tangent to the curve. A field line is a space 
curve, i.e., a curve in three dimensions. 

Figure 1.17 shows the field lines around some simple charge 
configurations. As mentioned earlier, the field lines are in 3-dimensional 
space, though the figure shows them only in a plane. The field lines of a 
single positive charge are radially outward while those of a single 


negative charge are radially inward. The field lines around a system of 
two positive charges (q, q) give a vivid pictorial description of their mutual 
repulsion, while those around the configuration of two equal and opposite 
charges 

(q, -q), a dipole, show clearly the mutual attraction between the charges. 
The field lines follow some important general properties: 

(i) Field lines start from positive charges and end at negative charges. If 
there is a single charge, they may start or end at infinity. 

(ii) In a charge-free region, electric field lines can be taken to be 
continuous curves without any breaks. 

(iii) Two field lines can never cross each other. (If they did, the field at the 
point of intersection will not have a unique direction, which is absurd.) 

(iv) Electrostatic field lines do not form any closed loops. This follows 
from the conservative nature of electric field (Chapter 2). 

* Solid angle is a measure of a cone. Consider the intersection of the 
given cone with a sphere of radius R. The solid angle AQ of the cone is 
defined to be equal to AS/R2, where AS is the area on the sphere cut out 
by the cone. 
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Figure 1.17 Field lines due to some simple charge configurations. 

1.10 Electric Flux 

Consider flow of a liquid with velocity v, through a small flat surface dS, in 
a direction normal to the surface. The rate of flow of liquid is given by the 
volume crossing the area per unit time v dS and represents the flux of 
liquid flowing across the plane. If the normal to the surface is not parallel 
to the direction of flow of liquid, i.e., to v, but makes an angle 0 with it, the 
projected area in a plane perpendicular to v is v dS cos 0. Therefore, the 
flux going out of the surface dS is v. ndS. For the case of the electric 
field, we define an analogous quantity and call it electric flux. We should, 
however, note that there is no flow of a physically observable quantity 
unlike the case of 
liquid flow. 

In the picture of electric field lines described above, we saw that the 
number of field lines crossing a unit area, placed normal to the field at a 
point is a measure of the strength of electric field at that point. This 
means that if we place a small planar element of area AS normal to E at 
a point, the number of field lines crossing it is proportional* to E AS. Now 
suppose we tilt the area element by angle 0. Clearly, the number of field 
lines crossing the area element will be smaller. The projection of the area 
element normal to E is AS cos0. Thus, the number of field lines crossing 
AS is proportional to E AS cos0. When 0 = 90°, field lines will be parallel 
to AS and will not cross it at all (Fig. 1.18). 
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FIGURE 1.18 Dependence of flux on the inclination 6 between E and h. 

The orientation of area element and not merely its magnitude is important 
in many contexts. For example, in a stream, the amount of water flowing 
through a ring will naturally depend on how you hold the ring. If you hold 
it normal to the flow, maximum water will flow through it than if you hold it 
with some other orientation. This shows that an area element should be 
treated as a vector. It has a magnitude and also a direction. How to 
specify the direction of a planar area? Clearly, the normal to the plane 
specifies the orientation of the plane. Thus the direction of a planar area 
vector is along its normal. 

How to associate a vector to the area of a curved surface? We imagine 
dividing the surface into a large number of very small area elements. 
Each small area element may be treated as planar and a vector 
associated with it, as explained before. 
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Figure 1.19 Convention for defining normal n and AS. 

Notice one ambiguity here. The direction of an area element is along its 
normal. But a normal can point in two directions. Which direction do we 
choose as the direction of the vector associated with the area element? 
This problem is resolved by some convention appropriate to the given 
context. For the case of a closed surface, this convention is very simple. 
The vector associated with every area element of a closed surface is 
taken to be in the direction of the outward normal. This is the convention 
used in Fig. 1.19. Thus, the area element vector AS at a point on a 
closed surface equals AS n where AS is the magnitude of the area 
element and n is a unit vector in the direction of outward normal at that 









point. 

We now come to the definition of electric flux. Electric flux Acp through an 

area element AS is defined by 

Acp = E.AS = E AS cos0 (1.11) 

* It will not be proper to say that the number of field lines is equal to EAS. 
The number of field lines is after all, a matter of how many field lines we 
choose to draw. What is physically significant is the relative number of 
field lines crossing a given area at different points. 

which, as seen before, is proportional to the number of field lines cutting 
the area element. The angle 0 here is the angle between E and AS. For a 
closed surface, with the convention stated already, 0 is the angle 
between E and the outward normal to the area element. Notice we could 
look at the expression E AS cos0 in two ways: E (AS cos0 ) i.e., E times 
the projection of area normal to E, or E_L AS, i.e., component of E along 
the normal to the area element times the magnitude of the area element. 

The unit of electric flux is N C -1 m 2 . 

The basic definition of electric flux given by Eq. (1.11) can be used, in 
principle, to calculate the total flux through any given surface. All we have 
to do is to divide the surface into small area elements, calculate the flux 
at each element and add them up. Thus, the total flux cp through a 
surface S is 

cp ~ I E.AS (1.12) 

The approximation sign is put because the electric field E is taken to be 
constant over the small area element. This is mathematically exact only 
when you take the limit AS —> 0 and the sum in Eq. (1.12) is written as an 
integral. 



1.11 Electric Dipole 

An electric dipole is a pair of equal and opposite point charges q and -q, 
separated by a distance 2a. The line connecting the two charges defines 
a direction in space. By convention, the direction from -q to q is said to 
be the direction of the dipole. The mid-point of locations of -q and q is 
called the centre of the dipole. 

The total charge of the electric dipole is obviously zero. This does not 
mean that the field of the electric dipole is zero. Since the charge q and 
-q are separated by some distance, the electric fields due to them, when 
added, do not exactly cancel out. However, at distances much larger than 
the separation of the two charges forming a dipole (r » 2a), the fields 
due to q and -q nearly cancel out. The electric field due to a dipole 

therefore falls off, at large distance, faster than like 1/r 2 (the dependence 
on r of the field due to a single charge q). These qualitative ideas are 
borne out by the explicit calculation as follows: 

1.11.1 The field of an electric dipole 

The electric field of the pair of charges (-q and q) at any point in space 
can be found out from Coulomb’s law and the superposition principle. 
The results are simple for the following two cases: (i) when the point is on 
the dipole axis, and (ii) when it is in the equatorial plane of the dipole, i.e., 
on a plane perpendicular to the dipole axis through its centre. The electric 
field at any general point P is obtained by adding the electric fields E-q 
due to the charge -q and E+q due to the charge q, by the parallelogram 

law of vectors. 

(i) For points on the axis 

Let the point P be at distance r from the centre of the dipole on the side 


of the charge q, as shown in Fig. 1.20(a). Then 


47tq>(r + a) 


2 P [1.13(a)] 


where p is the unit vector along the dipole axis (from -q to q). Also 


*+q= 2 P [1.13(b)] The total field at P is 
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Figure 1.20 Electric field of a dipole at (a) a point on the axis, (b) a point on the equatorial plane of 
the dipole, p is the dipole moment vector of magnitude p = q * 2a and directed from -q to q. 


(ii) For points on the equatorial plane 


The magnitudes of the electric fields due to the two charges +q and -q 
are given by 



q i 

4?!^ r 2 +a 2 


[1.16(a)] 
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[1.16(b)] 


and are equal. 

















The directions of E+q and E_q are as shown in 

Fig. 1.20(b). Clearly, the components normal to the dipole axis cancel 
away. The components along the dipole axis add up. The total electric 
field is opposite to p. We have 

E = - (E +q + E _q) cos0 p 


2 qa 

4 Jte a (r 2 +a 2 ) 3/2 


(1.17) 


At large distances (r» a), this reduces to 
E = - . 2qa s P (;' » a) (1.18) 

From Eqs. (1.15) and (1.18), it is clear that the dipole field at large 
distances does not involve q and a separately; it depends on the product 
qa. This suggests the definition of dipole moment. The dipole moment 
vector p of an electric dipole is defined by 
p = q x 2a p (1.19) 

that is, it is a vector whose magnitude is charge q times the separation 2a 
(between the pair of charges q, -q) and the direction is along the line 
from -q to q. In terms of p, the electric field of a dipole at large distances 
takes simple forms: 

At a point on the dipole axis 

E= i^ (r>>a)(1 - 20) 

0 

At a point on the equatorial plane 


E = - 


P 

4710/ 3 


(r» a) (1.21) 



Notice the important point that the dipole field at large distances falls off 

not as 1/r 2 but asl/r 3 . Further, the magnitude and the direction of the 
dipole field depends not only on the distance r but also on the angle 
between the position vector r and the dipole moment p. 

We can think of the limit when the dipole size 2a approaches zero, the 
charge q approaches infinity in such a way that the product 
p = q x 2a is finite. Such a dipole is referred to as a point dipole. For a 
point dipole, Eqs. (1.20) and (1.21) are exact, true for any r. 

1.11.2 Physical significance of dipoles 

In most molecules, the centres of positive charges and of negative 
charges* lie at the same place. Therefore, their dipole moment is zero. 
C0 2 and CH 4 are of this type of molecules. However, they develop a 

dipole moment when an electric field is applied. But in some molecules, 
the centres of negative charges and of positive charges do not coincide. 
Therefore they have a permanent electric dipole moment, even in the 
absence of an electric field. Such molecules are called polar molecules. 
Water molecules, H 2 0, is an example of this type. Various materials give 

rise to interesting properties and important applications in the presence 
or absence of electric field. 

Example 1.10 Two charges ±10 pC are placed 5.0 mm apart. 
Determine the electric field at (a) a point P on the axis of the dipole 
15 cm away from its centre O on the side of the positive charge, as 
shown in Fig. 1.21(a), and (b) a point Q, 15 cm away from O on a line 
passing through O and normal to the axis of the dipole, as shown in 
Fig. 1.21(b). 
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FIGURE 1.21 

Solution (a) Field at P due to charge +10 |jC 

nr 5 c i 

— y; 

4jT (8.804 x 1C" _N _L m~~) ' (15 - o* 1 o 4 m : 

= 4.13 x io 6 N C- 1 along BP 
Field at P due to charge -10 |jC 

icrc i 


X 


47r(8.S54xlO“ L - C- N _1 m -5 ) ’ ’ 0 + 0 2FJ 2 xlir 4 m 2 

= 3.86 x io 6 N C- 1 along PA 

The resultant electric field at P due to the two charges at A and B is = 
2.7 x io 5 N C- 1 along BP. 

In this example, the ratio OP/OB is quite large (= 60). Thus, we can 
expect to get approximately the same result as above by directly 








using the formula for electric field at a far-away point on the axis of a 
dipole. For a dipole consisting of charges ± q, 2a distance apart, the 
electric field at a distance r from the centre on the axis of the dipole 
has a magnitude 

E = 2p (r/a » 1) 

4it%r 3 

where p = 2a q is the magnitude of the dipole moment. 

The direction of electric field on the dipole axis is always along the 
direction of the dipole moment vector (i.e., from -q to q). Here, 

p = 10~ 5 C x 5 x 10- 3 m = 5 x 10- 8 C m 
Therefore, 


E = 


2x5x10 * G rn 


(8.854x I Cr 12 C 2 V nGO 


. 1 . =2.6x ICONIC- 1 

(I ol' x 10 in" 


along the dipole moment direction AB, which is close to the result 
obtained earlier. 

(b) Field at Q due to charge + 10 pC at B 

in - ' 1 (■ 1 

~ 4 n (8.854 x 10 C‘ N - m ‘ ) ’ll 5" + [0.25f] x 10 1 


= 3.99 x lO^C- 1 along BQ 
Field at Q due to charge -10 pC at A 
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1 tt (8.851 x 10 ' E 0 s _\ 1 rn 2 ) H 5 v (0.2 5H x 1 0 1 m 


= 3.99 x lO^C- 1 along QA. 

Clearly, the components of these two forces with equal magnitudes 
cancel along the direction OQ but add up along the direction parallel 






to BA. Therefore, the resultant electric field at Q due to the two 
charges at A and B is 

= 2 x . , = . x 3.99 X 10 6 NC-‘along BA 

,/l5 2 + (0.25) 2 a 

= 1.33 x 10 5 N C- 1 along BA. 

As in (a), we can expect to get approximately the same result by 
directly using the formula for dipole field at a point on the normal to 
the axis of the dipole: 
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= 1.33 x IQ 5 N C- 1 . 


The direction of electric field in this case is opposite to the direction of 
the dipole moment vector. Again, the result agrees with that obtained 
before. 


* Centre of a collection of positive point charges is defined much the 

yk't 

same way as the centre of mass: r cm = 




1.12 DIPOLE IN A UNIFORM EXTERNAL FIELD 

Consider a permanent dipole of dipole moment p in a uniform external 
field E, as shown in Fig. 1.22. (By permanent dipole, we mean that p 
exists irrespective of E; it has not been induced by E.) 








There is a force qE on q and a force -qE on -q. The net force on the 
dipole is zero, since E is uniform. However, the charges are separated, 
so the forces act at different points, resulting in a torque on the dipole. 
When the net force is zero, the torque (couple) is independent of the 
origin. Its magnitude equals the magnitude of each force multiplied by the 
arm of the couple (perpendicular distance between the two antiparallel 
forces). 

Magnitude of torque = q E x 2 a sin0 
= 2 q a E sin0 

Its direction is normal to the plane of the paper, coming out of it. 

The magnitude of p * E is also p E sin0 and its direction is normal to the 
paper, coming out of it. Thus, 

t = p x E (1.22) 

This torque will tend to align the dipole with the field E. When p is aligned 
with E, the torque is zero. 

What happens if the field is not uniform? In that case, the net force will 
evidently be non-zero. In addition there will, in general, be a torque on 
the system as before. The general case is involved, so let us consider the 
simpler situations when p is parallel to E or antiparallel to E. In either 
case, the net torque is zero, but there is a net force on the dipole if E is 
not uniform. 

Figure 1.23 is self-explanatory. It is easily seen that when p is parallel to 
E, the dipole has a net force in the direction of increasing field. When p is 
antiparallel to E, the net force on the dipole is in the direction of 
decreasing field. In general, the force depends on the orientation of p 
with respect to E. 
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FIGURE 1.22 Dipole in a uniform electric field. 
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FIGURE 1.23 Electric force on a dipole: (a) E parallel to p, (b) E antiparallel to p. 


This brings us to a common observation in frictional electricity. A comb 
run through dry hair attracts pieces of paper. The comb, as we know, 
acquires charge through friction. But the paper is not charged. What then 
explains the attractive force? Taking the clue from the preceding 



















discussion, the charged comb ‘polarises’ the piece of paper, i.e., induces 
a net dipole moment in the direction of field. Further, the electric field due 
to the comb is not uniform. In this situation, it is easily seen that the paper 
should move in the direction of the comb! 

1.13 Continuous Charge Distribution 

We have so far dealt with charge configurations involving discrete 
charges q 1; q2, ..., qn. One reason why we restricted to discrete charges 

is that the mathematical treatment is simpler and does not involve 
calculus. For many purposes, however, it is impractical to work in terms 
of discrete charges and we need to work with continuous charge 
distributions. For example, on the surface of a charged conductor, it is 
impractical to specify the charge distribution in terms of the locations of 
the microscopic charged constituents. It is more feasible to consider an 
area element AS (Fig. 1.24) on the surface of the conductor (which is 
very small on the macroscopic scale but big enough to include a very 
large number of electrons) and specify the charge AQ on that element. 
We then define a surface charge density o at the area element by 

a = — (1.23) 

AS 

We can do this at different points on the conductor and thus arrive at a 
continuous function o, called the surface charge density. The surface 
charge density a so defined ignores the quantisation of charge and the 
discontinuity in charge distribution at the microscopic level*, a represents 
macroscopic surface charge density, which in a sense, is a smoothed out 
average of the microscopic charge density over an area element AS 
which, as said before, is large microscopically but small macroscopically. 



The units for a are C/m 2 . 


Similar considerations apply for a line charge distribution and a volume 
charge distribution. The linear charge density A of a wire is defined by 


X - 



(1.24) 


where Al is a small line element of wire on the macroscopic scale that, 
however, includes a large number of microscopic charged constituents, 
and AQ is the charge contained in that line element. The units for A are 
C/m. The volume charge density (sometimes simply called charge 
density) is defined in a similar manner: 


P - 



(1.25) 


where AQ is the charge included in the macroscopically small volume 
element AV that includes a large number of microscopic charged 

constituents. The units for p are C/m 3 . 

The notion of continuous charge distribution is similar to that we adopt for 
continuous mass distribution in mechanics. When we refer to the density 
of a liquid, we are referring to its macroscopic density. We regard it as a 
continuous fluid and ignore its discrete molecular constitution. 
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Figure 1.24 Definition of linear, surface and volume charge densities. In each case, the element 
(AI, AS, AV) chosen is small on the macroscopic scale but contains a very large number of 

microscopic constituents. 

The field due to a continuous charge distribution can be obtained in much 
the same way as for a system of discrete charges, Eq. (1.10). Suppose a 
continuous charge distribution in space has a charge density p. Choose 
any convenient origin O and let the position vector of any point in the 
charge distribution be r. The charge density p may vary from point to 
point, i.e., it is a function of r. Divide the charge distribution into small 
volume elements of size AV. The charge in a volume element AV is pAV. 

Now, consider any general point P (inside or outside the distribution) with 
position vector R (Fig. 1.24). Electric field due to the charge pAV is given 




by Coulomb’s law: 
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— 2 — r (1-26) where r' is the distance between the 
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charge element and P, and r' is a unit vector in the direction from the 
charge element to P. By the superposition principle, the total electric field 
due to the charge distribution is obtained by summing over electric fields 
due to different volume elements: 
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(1.27) 


Note that p, r', r' all can vary from point to point. In a strict mathematical 
method, we should let AV—>0 and the sum then becomes an integral; but 
we omit that discussion here, for simplicity. In short, using Coulomb’s law 
and the superposition principle, electric field can be determined for any 
charge distribution, discrete or continuous or part discrete and part 
continuous. 


* At the microscopic level, charge distribution is discontinuous, because 
they are discrete charges separated by intervening space where there is 
no charge. 


1.14 Gauss’s Law 


As a simple application of the notion of electric flux, let us consider the 
total flux through a sphere of radius r, which encloses a point charge q at 
its centre. Divide the sphere into small area elements, as shown in 
Fig. 1.25. 

The flux through an area element AS is 
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Figure 1.25 Flux through a sphere enclosing a point charge q at its centre. 

where we have used Coulomb’s law for the electric field due to a single 
charge q. The unit vector f is along the radius vector from the centre to 
the area element. Now, since the normal to a sphere at every point is 
along the radius vector at that point, the area element AS and i have the 
same direction. Therefore, 

A4> = - ——z AS (1.29) 

4 r 

since the magnitude of a unit vector is 1. 

The total flux through the sphere is obtained by adding up flux through all 
the different area elements: 


Since each area element of the sphere is at the same distance r from the 
charge, 
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Now S, the total area of the sphere, equals 4nr 2 . Thus, 






Equation (1.30) is a simple illustration of a general result of electrostatics 
called Gauss’s law. 

We state Gauss’s law without proof: 

Electric flux through a closed surface S 

= q/£ 0 (1-31) 

q = total charge enclosed by S. 

The law implies that the total electric flux through a closed surface is zero 
if no charge is enclosed by the surface. We can see that explicitly in the 
simple situation of Fig. 1.26. 

Here the electric field is uniform and we are considering a closed 
cylindrical surface, with its axis parallel to the uniform field E. The total 
flux cp through the surface is cp = cp 1 + cp 2 + cp 3 , where cp 1 and cp 2 represent 

the flux through the surfaces 1 and 2 (of circular cross-section) of the 
cylinder and cp 3 is the flux through the curved cylindrical part of the closed 

surface. Now the normal to the surface 3 at every point is perpendicular 
to E, so by definition of flux, cp 3 = 0. Further, the outward normal to 2 is 

along E while the outward normal to 1 is opposite to E. Therefore, 
cp-i = -E S.|, cp 2 = +E S 2 

Si = s 2 — s 

where S is the area of circular cross-section. Thus, the total flux is zero, 
as expected by Gauss’s law. Thus, whenever you find that the net electric 
flux through a closed surface is zero, we conclude that the total charge 
contained in the closed surface is zero. 



Figure 1.26 Calculation of the flux of uniform electric field through the 
surface of a cylinder. 



FIGURE 1.26 Calculation of the flux of uniform electric field through the surface of a cylinder. 

The great significance of Gauss’s law Eq. (1.31), is that it is true in 
general, and not only for the simple cases we have considered above. 
Let us note some important points regarding this law: 

(i) Gauss’s law is true for any closed surface, no matter what its shape or 
size. 

(ii) The term q on the right side of Gauss’s law, Eq. (1.31), includes the 
sum of all charges enclosed by the surface. The charges may be located 
anywhere inside the surface. 

(iii) In the situation when the surface is so chosen that there are some 
charges inside and some outside, the electric field [whose flux appears 
on the left side of Eq. (1.31)] is due to all the charges, both inside and 
outside S. The term q on the right side of Gauss’s law, however, 
represents only the total charge inside S. 

(iv) The surface that we choose for the application of Gauss’s law is 
called the Gaussian surface. You may choose any Gaussian surface and 
apply Gauss’s law. However, take care not to let the Gaussian surface 
pass through any discrete charge. This is because electric field due to a 
system of discrete charges is not well defined at the location of any 
charge. (As you go close to the charge, the field grows without any 
bound.) However, the Gaussian surface can pass through a continuous 
charge distribution. 







(v) Gauss’s law is often useful towards a much easier calculation of the 
electrostatic field when the system has some symmetry. This is facilitated 
by the choice of a suitable Gaussian surface. 

(vi) Finally, Gauss’s law is based on the inverse square dependence on 
distance contained in the Coulomb’s law. Any violation of Gauss’s law will 
indicate departure from the inverse square law. 

Example 1.11 The electric field components in Fig. 1.27 are 

E x = ax 1/2 , E y = E z = 0, in which a = 800 N/C m 1/2 . Calculate (a) the 

flux through the cube, and (b) the charge within the cube. Assume 
that a = 0.1 m. 



Figure 1.27 

Solution 

(a) Since the electric field has only an x component, for faces 
perpendicular to x direction, the angle between E and AS is 
± tt/2. Therefore, the flux cp = E.AS is separately zero for each face of 














the cube except the two shaded ones. Now the magnitude of the 
electric field at the left face is 

E L = ax 1/2 = aa 1/2 

(x = a at the left face). 

The magnitude of electric field at the right face is 
E r — a x 1/2 = a (2a) 1/2 

(x = 2a at the right face). 

The corresponding fluxes are 

cp L = E l .AS = \S E, ■ n. =E l AS cos9 = -E L AS, since 0 = 180° 

= —E L a 2 

cp R = E r .AS = E r AS cos0 = E r AS, since 9 = 0° 

= E R a 2 

Net flux through the cube 

= cp R + cp L = E R a 2 - E L a 2 = a 2 (E R - E L ) = aa 2 [(2a) 1/2 - a 1/2 ] 

= aa 5/2 (72 -l) 

= 800 (0.1 ) 5/2 (72 -l) 

= 1.05 N m 2 C" 1 

(b) We can use Gauss’s law to find the total charge q inside the cube. 
We have cp = q/e 0 or q = cpe 0 . Therefore, 

q = 1.05 x 8.854 x IQ- 12 C = 9.27 x 1Q- 12 C. 




Example 1.12 An electric field is uniform, and in the positive x 
direction for positive x, and uniform with the same magnitude but in 
the negative x direction for negative x. It is given that E = 200 j N/C 
for x > 0 and E = -200 j N/C for x < 0. A right circular cylinder of 
length 20 cm and radius 5 cm has its centre at the origin and its axis 
along the x-axis so that one face is at x = +10 cm and the other is at x 
= -10 cm (Fig. 1.28). (a) What is the net outward flux through each 
flat face? (b) What is the flux through the side of the cylinder? 

(c) What is the net outward flux through the cylinder? (d) What is the 
net charge inside the cylinder? 

Solution 

(a) We can see from the figure that on the left face E and AS are 
parallel. Therefore, the outward flux is 

cp L = E.AS = - 200 ig\s 

= + 200 AS, since ig\s = - AS 

= + 200 x tt (0.05) 2 = + 1.57 N m 2 C” 1 

On the right face, E and AS are parallel and therefore 

cp R = E.AS = + 1.57 Nm 2 Cr\ 

(b) For any point on the side of the cylinder E is perpendicular to AS 
and hence E.AS = 0. Therefore, the flux out of the side of the cylinder 
is zero. 

(c) Net outward flux through the cylinder 
cp = 1.57 + 1.57 + 0 = 3.14 N m 2 C- 1 
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Figure 1.28 


(d) The net charge within the cylinder can be found by using Gauss’s 
law which gives 


q = £ 0 cp 


= 3.14 x 8.854 x 10- 12 C 
= 2.78 x IQ- 11 C 


1.15 APPLICATIONS OF GAUSS’S LAW 

The electric field due to a general charge distribution is, as seen above, 
given by Eq. (1.27). In practice, except for some special cases, the 
summation (or integration) involved in this equation cannot be carried out 
to give electric field at every point in 

space. For some symmetric charge configurations, however, it is possible 
to obtain the electric field in a simple way using the Gauss’s law. This is 
best understood by some examples. 

1.15.1 Field due to an infinitely long straight uniformly 
charged wire 








Consider an infinitely long thin straight wire with uniform linear charge 
density A. The wire is obviously an axis of symmetry. Suppose we take 
the radial vector from O to P and rotate it around the wire. The points P, P 
P" so obtained are completely equivalent with respect to the charged 
wire. This implies that the electric field must have the same magnitude at 
these points. The direction of electric field at every point must be radial 
(outward if A > 0, inward if A < 0). This is clear from Fig. 1.29. 

Consider a pair of line elements P 1 and P 2 of the wire, as shown. The 

electric fields produced by the two elements of the pair when summed 
give a resultant electric field which is radial (the components normal to 
the radial vector cancel). This is true for any such pair and hence the total 
field at any point P is radial. Finally, since the wire is infinite, electric field 
does not depend on the position of P along the length of the wire. In 
short, the electric field is everywhere radial in the plane cutting the wire 
normally, and its magnitude depends only on the radial distance r. 

To calculate the field, imagine a cylindrical Gaussian surface, as shown in 
the Fig. 1.29(b). Since the field is everywhere radial, flux through the two 
ends of the cylindrical Gaussian surface is zero. At the cylindrical part of 
the surface, E is normal to the surface at every point, and its magnitude 
is constant, since it depends only on r. The surface area of the curved 
part is 2url, where I is the length of the cylinder. 

Flux through the Gaussian surface 

= flux through the curved cylindrical part of the surface 

= E x 2url 

The surface includes charge equal to A I. Gauss’s law then gives 
E x 2url = AI/£ 0 



Vectorially, E at any point is given by 


E - 


/: 


2 Kr.-.r 


n (1.32) 


where n is the radial unit vector in the plane normal to the wire passing 
through the point. E is directed outward if A is positive and inward if A is 
negative. 




[a) 



(b) 

























Figure 1.29 (a) Electric field due to an infinitely long thin straight wire is radial, (b) The Gaussian 
surface for a long thin wire of uniform linear charge density. 


Note that when we write a vector A as a scalar multiplied by a unit vector, 
i.e., as A = A a , the scalar A is an algebraic number. It can be negative 
or positive. The direction of A will be the same as that of the unit vector 
a if A > 0 and opposite to a if A < 0. When we want to restrict to non¬ 
negative values, we use the symbol |A|and call it the modulus of A. 
Thus, A > 0 . 

Also note that though only the charge enclosed by the surface (Al) was 
included above, the electric field E is due to the charge on the entire wire. 
Further, the assumption that the wire is infinitely long is crucial. Without 
this assumption, we cannot take E to be normal to the curved part of the 
cylindrical Gaussian surface. However, Eq. (1.32) is approximately true 
for electric field around the central portions of a long wire, where the end 
effects may be ignored. 

1.15.2 Field due to a uniformly charged infinite plane sheet 

Let a be the uniform surface charge density of an infinite plane sheet 
(Fig. 1.30). We take the x-axis normal to the given plane. By symmetry, 
the electric field will not depend on y and z coordinates and its direction 
at every point must be parallel to the x-direction. 

We can take the Gaussian surface to be a rectangular parallelepiped of 
cross-sectional area A, as shown. (A cylindrical surface will also do.) As 
seen from the figure, only the two faces 1 and 2 will contribute to the flux; 
electric field lines are parallel to the other faces and they, therefore, do 
not contribute to the total flux. 



Surface 

z charge density a 



Figure 1.30 Gaussian surface for a uniformly charged infinite plane sheet. 

The unit vector normal to surface 1 is in -x direction while the unit vector 
normal to surface 2 is in the +x direction. Therefore, flux E.AS through 
both the surfaces are equal and add up. Therefore the net flux through 
the Gaussian surface is 2 EA. The charge enclosed by the closed surface 
is oA. Therefore by Gauss’s law, 

2 EA = oA/s 0 
or, E = a/2s 0 


Vectorically, 


E = 


o 



n (1.33) 


where h is a unit vector normal to the plane and going away from it. 

E is directed away from the plate if a is positive and toward the plate if a 
is negative. Note that the above application of the Gauss’ law has 
brought out an additional fact: E is independent of x also. 

For a finite large planar sheet, Eq. (1.33) is approximately true in the 
middle regions of the planar sheet, away from the ends. 

1.15.3 Field due to a uniformly charged thin spherical shell 





















Let a be the uniform surface charge density of a thin spherical shell of 
radius R (Fig. 1.31). The situation has obvious spherical symmetry. The 
field at any point P, outside or inside, can depend only on r (the radial 
distance from the centre of the shell to the point) and must be radial (i.e., 
along the radius vector). 
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Figure 1.31 Gaussian surfaces for a point with (a) r> R, (b) r< R. 


(i) Field outside the shell: Consider a point P outside the shell with 
radius vector r. To calculate E at P, we take the Gaussian surface to be a 
sphere of radius r and with centre O, passing through P. All points on this 
sphere are equivalent relative to the given charged configuration. (That is 


what we mean by spherical symmetry.) The electric field at each point of 
the Gaussian surface, therefore, has the same magnitude E and is along 
the radius vector at each point. Thus, E and AS at every point are parallel 
and the flux through each element is E AS. Summing over all AS, the flux 

through the Gaussian surface is E x 4 tt r 2 . The charge enclosed is 
a x 4 tt R 2 . By Gauss’s law 


E x 4 tt r 2 = 


(T 


i: tl 


Or, E- 


<tR~ 


r 


4 ji R 


q 

4 7zt\ r~ 


where q = 4 tt R 2 o is the total charge on the spherical shell. 
Vectorially, 

<t 


E 


r 


yr (1.34) 


The electric field is directed outward if q > 0 and inward if 
q < 0. This, however, is exactly the field produced by a charge 
q placed at the centre O. Thus for points outside the shell, the field due to 
a uniformly charged shell is as if the entire charge of the shell is 
concentrated at its centre. 

(ii) Field inside the shell: In Fig. 1.31(b), the point P is inside the shell. 
The Gaussian surface is again a sphere through P centred at O. The flux 
through the Gaussian surface, calculated as before, is 

E x 4 tt r 2 . However, in this case, the Gaussian surface encloses no 
charge. Gauss’s law then gives 

E x 4 tt r 2 = 0 



that is, the field due to a uniformly charged thin shell is zero at all points 
inside the shell*. This important result is a direct consequence of Gauss’s 
law which follows from Coulomb’s law. The experimental verification of 

this result confirms the 1/r 2 dependence in Coulomb’s law. 

Example 1.13 An early model for an atom considered it to have a 
positively charged point nucleus of charge Ze, surrounded by a 
uniform density of negative charge up to a radius R. The atom as a 
whole is neutral. For this model, what is the electric field at a distance 
r from the nucleus? 



Solution The charge distribution for this model of the atom is as 
shown in Fig. 1.32. The total negative charge in the uniform spherical 
charge distribution of radius R must be -Z e, since the atom (nucleus 
of charge Z e + negative charge) is neutral. This immediately gives us 

A TT P 

the negative charge density p, since we must have —— 


p o - ze 




To find the electric field E(r) at a point P which is a distance r away 
from the nucleus, we use Gauss’s law. Because of the spherical 
symmetry of the charge distribution, the magnitude of the electric field 
E(r) depends only on the radial distance, no matter what the direction 
of r. Its direction is along (or opposite to) the radius vector r from the 
origin to the point P. The obvious Gaussian surface is a spherical 
surface centred at the nucleus. We consider two situations, namely, r 
< R and r > R. 

(i) r < R : The electric flux (p enclosed by the spherical surface is 
cp = E (r) x 4 tt r 2 

where E (r) is the magnitude of the electric field at r. This is because 
the field at any point on the spherical Gaussian surface has the same 
direction as the normal to the surface there, and has the same 
magnitude at all points on the surface. 

The charge q enclosed by the Gaussian surface is the positive 
nuclear charge and the negative charge within the sphere of radius r, 


i-6-,q Z c 


4 nr" 


O 
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Substituting for the charge density p obtained earlier, we have 

r" 

q - Z v. - Z 


It' 


Gauss’s law then gives, fC(r). 


Z 


4 7t 




^ * 


r < K 


The electric field is directed radially outward. 

(ii) r > R: In this case, the total charge enclosed by the Gaussian 
spherical surface is zero since the atom is neutral. Thus, from 









Gauss’s law, 

E (r) x 4 tt r 2 = 0 or E (r) = 0; r > R 

At r = R, both cases give the same result: E = 0. 


On symmetry operations 

In Physics, we often encounter systems with various symmetries. 
Consideration of these symmetries helps one arrive at results much 
faster than otherwise by a straightforward calculation. Consider, for 
example an infinite uniform sheet of charge (surface charge 
density a) along the y-z plane. This system is unchanged if (a) 
translated parallel to the y-z plane in any direction, (b) rotated about 
the x-axis through any angle. As the system is unchanged under 
such symmetry operation, so must its properties be. In particular, in 
this example, the electric field E must be unchanged. 

Translation symmetry along the y-axis shows that the electric field 
must be the same at a point (O.y^ 0) as at (0, y 2 , 0). Similarly 

translational symmetry along the z-axis shows that the electric field at 
two point (0, 0, z-|) and (0, 0, z 2 ) must be the same. By using rotation 

symmetry around the x-axis, we can conclude that E must be 
perpendicular to the y-z plane, that is, it must be parallel to the x- 
direction. 

Try to think of a symmetry now which will tell you that the magnitude 
of the electric field is a constant, independent of the x-coordinate. It 
thus turns out that the magnitude of the electric field due to a 






uniformly charged infinite conducting sheet is the same at all points in 
space. The direction, however, is opposite of each other on either 
side of 
the sheet. 

Compare this with the effort needed to arrive at this result by a direct 
calculation using Coulomb’s law. 


Summary 

1. Electric and magnetic forces determine the properties of atoms, 
molecules and bulk matter. 

2. From simple experiments on frictional electricity, one can infer that 
there are two types of charges in nature; and that like charges repel 
and unlike charges attract. By convention, the charge on a glass rod 
rubbed with silk is positive; that on a plastic rod rubbed with fur is 
then negative. 

3. Conductors allow movement of electric charge through them, 
insulators do not. In metals, the mobile charges are electrons; in 
electrolytes both positive and negative ions are mobile. 

4. Electric charge has three basic properties: quantisation, additivity 
and conservation. 

Quantisation of electric charge means that total charge (q) of a body 
is always an integral multiple of a basic quantum of charge (e) i.e., 
q = n e, where n = 0, ±1, ±2, ±3, .... Proton and electron have charges 
+e, -e, respectively. For macroscopic charges for which n is a very 
large number, quantisation of charge can be ignored. 






Additivity of electric charges means that the total charge of a system 
is the algebraic sum (i.e., the sum taking into account proper signs) of 
all individual charges in the system. 

* Compare this with a uniform mass shell discussed in Section 8.5 of 
Class XI Textbook of Physics. 

Conservation of electric charges means that the total charge of an 
isolated system remains unchanged with time. This means that when 
bodies are charged through friction, there is a transfer of electric 
charge from one body to another, but no creation or destruction of 
charge. 


5. Coulomb’s Law: The mutual electrostatic force between two point 
charges q-| and q 2 is proportional to the product q 1 q 2 and inversely 

proportional to the square of the distance r 21 separating them. 

Mathematically, 


F 21 = force on q 2 due to c/. 


k fa/jJ r 


where r. ; | is a unit vector in the direction from q 1 to q 2 and k = 

■1 7Tf(, 


is the constant of proportionality. 


In SI units, the unit of charge is coulomb. The experimental value of 
the constant £ 0 is 


e 0 = 8.854 x 10 -12 C 2 N” 1 rrr 2 
The approximate value of k is 
k = 9 x io 9 N m 2 C- 2 

6. The ratio of electric force and gravitational force between a proton 




and an electron is 

,V ' =2.4x10™ 

G m t jn n 

7. Superposition Principle: The principle is based on the property that 
the forces with which two charges attract or repel each other are not 
affected by the presence of a third (or more) additional charge(s). For 
an assembly of charges q 1; q 2 , q 3 , the force on any charge, say 

q 1; is the vector sum of the force on q 1 due to q 2 , the force on q 1 due 

to q 3 , and so on. For each pair, the force is given by the Coulomb’s 

law for two charges stated earlier. 

8. The electric field E at a point due to a charge configuration is the 
force on a small positive test charge q placed at the point divided by 
the magnitude of the charge. Electric field due to a point charge q has 

a magnitude |q|/4m 0 r 2 ; it is radially outwards from q, if q is positive, 

and radially inwards if q is negative. Like Coulomb force, electric field 
also satisfies superposition principle. 

9. An electric field line is a curve drawn in such a way that the 
tangent at each point on the curve gives the direction of electric field 
at that point. The relative closeness of field lines indicates the relative 
strength of electric field at different points; they crowd near each 
other in regions of strong electric field and are far apart where the 
electric field is weak. In regions of constant electric field, the field 
lines are uniformly spaced parallel straight lines. 

10. Some of the important properties of field lines are: (i) Field lines 
are continuous curves without any breaks, (ii) Two field lines cannot 
cross each other, (iii) Electrostatic field lines start at positive charges 
and end at negative charges —they cannot form closed loops. 




11. An electric dipole is a pair of equal and opposite charges q and -q 
separated by some distance 2a. Its dipole moment vector p has 
magnitude 2qa and is in the direction of the dipole axis from -q to q. 

12. Field of an electric dipole in its equatorial plane (i.e., the plane 
perpendicular to its axis and passing through its centre) at a distance 
r from the centre: 
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Dipole electric field on the axis at a distance r from the centre: 
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The 1/r 3 dependence of dipole electric fields should be noted in 
contrast to the 1/r 2 dependence of electric field due to a point charge. 

13. In a uniform electric field E, a dipole experiences a torque x given 
by 

T = p X E 

but experiences no net force. 

14. The flux Acp of electric field E through a small area element AS is 
given by 

Acp = E.AS 

The vector area element AS is 







AS — AS n 

where AS is the magnitude of the area element and n is normal to 
the area element, which can be considered planar for sufficiently 
small AS. For an area element of a closed surface, n is taken to be 
the direction of outward normal, by convention. 

15. Gauss’s law: The flux of electric field through any closed surface 
S is 1 /e 0 times the total charge enclosed by S. The law is especially 

useful in determining electric field E, when the source distribution has 
simple symmetry: 

(i) Thin infinitely long straight wire of uniform linear charge density A 

X 

E =- 11 

2 ru',. r 

where r is the perpendicular distance of the point from the wire and 
nis the radial unit vector in the plane normal to the wire passing 
through the point. 

(ii) Infinite thin plane sheet of uniform surface charge density o 

E - n 

2 F 

I i 

where n is a unit vector normal to the plane, outward on either side. 

(iii) Thin spherical shell of uniform surface charge density o 

E -^—- r [r > R) 

4 iu:, r" 

E = 0 (r < R) 

where r is the distance of the point from the centre of the shell and R 
the radius of the shell, q is the total charge of the shell: q = 4ttR2o. 






The electric field outside the shell is as though the total charge is 
concentrated at the centre. The same result is true for a solid sphere 
of uniform volume charge density. The field is zero at all points inside 
the shell. 



Points to Ponder 

1. You might wonder why the protons, all carrying positive charges, 
are compactly residing inside the nucleus. Why do they not fly away? 
You will learn that there is a third kind of a fundamental force, called 
the strong force which holds them together. The range of distance 

where this force is effective is, however, very small ~10' 14 m. This is 
precisely the size of the nucleus. Also the electrons are not allowed 
to sit on top of the protons, i.e. inside the nucleus, due to the laws of 
quantum mechanics. This gives the atoms their structure as they 
exist in nature. 

2. Coulomb force and gravitational force follow the same inverse- 
square law. But gravitational force has only one sign (always 
attractive), while Coulomb force can be of both signs (attractive and 







repulsive), allowing possibility of cancellation of electric forces. This is 
how gravity, despite being a much weaker force, can be a dominating 
and more pervasive force in nature. 

3. The constant of proportionality k in Coulomb’s law is a matter of 
choice if the unit of charge is to be defined using Coulomb’s law. In SI 
units, however, what is defined is the unit of current (A) via its 
magnetic effect (Ampere’s law) and the unit of charge (coulomb) is 
simply defined by (1C = 1 A s). In this case, the value of k is no 

longer arbitrary; it is approximately 9 * 10 9 N m 2 C -2 . 

4. The rather large value of k, i.e., the large size of the unit of charge 
(1C) from the point of view of electric effects arises because (as 
mentioned in point 3 already) the unit of charge is defined in terms of 
magnetic forces (forces on current-carrying wires) which are 
generally much weaker than the electric forces. Thus while 1 ampere 
is a unit of reasonable size for magnetic effects, 1 C = 1 A s, is too 
big a unit for electric effects. 

5. The additive property of charge is not an ‘obvious’ property. It is 
related to the fact that electric charge has no direction associated 
with it; charge is a scalar. 

6. Charge is not only a scalar (or invariant) under rotation; it is also 
invariant for frames of reference in relative motion. This is not always 
true for every scalar. For example, kinetic energy is a scalar under 
rotation, but is not invariant for frames of reference in relative 
motion. 

7. Conservation of total charge of an isolated system is a property 
independent of the scalar nature of charge noted in point 6. 
Conservation refers to invariance in time in a given frame of 
reference. A quantity may be scalar but not conserved (like kinetic 




energy in an inelastic collision). On the other hand, one can have 
conserved vector quantity (e.g., angular momentum of an isolated 
system). 

8. Quantisation of electric charge is a basic (unexplained) law of 
nature; interestingly, there is no analogous law on quantisation of 
mass. 

9. Superposition principle should not be regarded as ‘obvious’, or 
equated with the law of addition of vectors. It says two things: force 
on one charge due to another charge is unaffected by the presence 
of other charges, and there are no additional three-body, four-body, 
etc., forces which arise only when there are more than two charges. 

10. The electric field due to a discrete charge configuration is not 
defined at the locations of the discrete charges. For continuous 
volume charge distribution, it is defined at any point in the 
distribution. For a surface charge distribution, electric field is 
discontinuous across the surface. 

11. The electric field due to a charge configuration with total charge 
zero is not zero; but for distances large compared to the size of 

the configuration, its field falls off faster than 1/r 2 , typical of field due 
to a single charge. An electric dipole is the simplest example of this 
fact. 


EXERCISES 


1.1 What is the force between two small charged spheres having 






charges of 2 x 10 _7 C and 3 * 10 _7 C placed 30 cm apart in air? 

1.2 The electrostatic force on a small sphere of charge 0.4 pC due to 
another small sphere of charge -0.8 pC in air is 0.2 N. (a) What is the 
distance between the two spheres? (b) What is the force on the 
second sphere due to the first? 

1.3 Check that the ratio ke 2 /G m e m p is dimensionless. Look up a 

Table of Physical Constants and determine the value of this ratio. 
What does the ratio signify? 

1.4 (a) Explain the meaning of the statement ‘electric charge of a 
body is quantised’. 

(b) Why can one ignore quantisation of electric charge when dealing 
with macroscopic i.e., large scale charges? 

1.5 When a glass rod is rubbed with a silk cloth, charges appear on 
both. A similar phenomenon is observed with many other pairs of 
bodies. Explain how this observation is consistent with the law of 
conservation of charge. 

1.6 Four point charges q A = 2 pC, q B = -5 pC, q c = 2 pC, and q D = -5 

pC are located at the corners of a square ABCD of side 10 cm. What 
is the force on a charge of 1 pC placed at the centre of the square? 

1.7 (a) An electrostatic field line is a continuous curve. That is, a field 
line cannot have sudden breaks. Why not? 

(b) Explain why two field lines never cross each other at any point? 

1.8 Two point charges q A = 3 pC and q B = -3 pC are located 20 cm 
apart in vacuum. 

(a) What is the electric field at the midpoint O of the line AB joining 




the two charges? 

(b) If a negative test charge of magnitude 1.5 * 10 -9 C is placed at 
this point, what is the force experienced by the test charge? 

1.9 A system has two charges q A = 2.5 * 1CT 7 C and q B = -2.5 * 10 -7 

C located at points A: (0, 0, -15 cm) and B: (0,0, +15 cm), 
respectively. What are the total charge and electric dipole moment of 
the system? 

1.10 An electric dipole with dipole moment 4 x 10 -9 C m is aligned at 
30° with the direction of a uniform electric field of magnitude 5 * 10 4 
NC -1 . Calculate the magnitude of the torque acting on the dipole. 

1.11 A polythene piece rubbed with wool is found to have a negative 
charge of 3 * 1CT 7 C. 

(a) Estimate the number of electrons transferred (from which to 
which?) 

(b) Is there a transfer of mass from wool to polythene? 

1.12 (a) Two insulated charged copper spheres A and B have their 
centres separated by a distance of 50 cm. What is the mutual force of 

electrostatic repulsion if the charge on each is 6.5 * 10 -7 C? The radii 
of A and B are negligible compared to the distance of separation. 

(b) What is the force of repulsion if each sphere is charged double 
the above amount, and the distance between them is halved? 

1.13 Suppose the spheres A and B in Exercise 1.12 have identical 
sizes. A third sphere of the same size but uncharged is brought in 
contact with the first, then brought in contact with the second, and 
finally removed from both. What is the new force of repulsion 




between A and B? 

1.14 Figure 1.33 shows tracks of three charged particles in a uniform 
electrostatic field. Give the signs of the three charges. Which particle 
has the highest charge to mass ratio? 



Figure 1.33 


1.15 Consider a uniform electric field E = 3 * 10 3 T N/C. (a) What is 
the flux of this field through a square of 10 cm on a side whose plane 
is parallel to the yz plane? (b) What is the flux through the same 
square if the normal to its plane makes a 60° angle with the x-axis? 

1.16 What is the net flux of the uniform electric field of Exercise 1.15 
through a cube of side 20 cm oriented so that its faces are parallel to 
the coordinate planes? 

1.17 Careful measurement of the electric field at the surface of a 
black box indicates that the net outward flux through the surface of 

the box is 8.0 * 10 3 Nm 2 /C. (a) What is the net charge inside the 
box? (b) If the net outward flux through the surface of the box were 
zero, could you conclude that there were no charges inside the box? 
Why or Why not? 

1.18 A point charge +10 pC is a distance 5 cm directly above the 
centre of a square of side 10 cm, as shown in Fig. 1.34. What is the 
magnitude of the electric flux through the square? (Hint: Think of the 
square as one face of a cube with edge 10 cm.) 















5 crn 



10 cm 

Figure 1.34 


1.19 A point charge of 2.0 pC is at the centre of a cubic Gaussian 
surface 9.0 cm on edge. What is the net electric flux through the 
surface? 

1.20 A point charge causes an electric flux of -1.0 * 10 3 Nm 2 /C to 
pass through a spherical Gaussian surface of 10.0 cm radius centred 
on the charge, (a) If the radius of the Gaussian surface were doubled, 
how much flux would pass through the surface? (b) What is the value 
of the point charge? 

1.21 A conducting sphere of radius 10 cm has an unknown charge. If 

the electric field 20 cm from the centre of the sphere is 1.5 * 10 3 N/C 
and points radially inward, what is the net charge on the sphere? 

1.22 A uniformly charged conducting sphere of 2.4 m diameter has a 

surface charge density of 80.0 pC/m 2 . (a) Find the charge on the 
sphere, (b) What is the total electric flux leaving the surface of the 
sphere? 











1.23 An infinite line charge produces a field of 9 * 10 4 N/C at a 
distance of 2 cm. Calculate the linear charge density. 

1.24 Two large, thin metal plates are parallel and close to each other. 
On their inner faces, the plates have surface charge densities of 

opposite signs and of magnitude 17.0 x 10 -22 C/m 2 . What is E: (a) in 
the outer region of the first plate, (b) in the outer region of the second 
plate, and (c) between the plates? 


ADDITIONAL EXERCISES 

1.25 An oil drop of 12 excess electrons is held stationary under a 
constant electric field of 2.55 x 104 NC-1 (Millikan’s oil drop 
experiment). The density of the oil is 1.26 g cm-3. Estimate the 
radius of the drop. 

(g = 9.81 m s-2; e = 1.60 x 10-19 C). 

1.26 Which among the curves shown in Fig. 1.35 cannot possibly 
represent electrostatic field lines? 



fa) 













v 



(e) 

Figure 1.35 

1.27 In a certain region of space, electric field is along the z-direction 
throughout. The magnitude of electric field is, however, not constant 
but increases uniformly along the positive z-direction, at the rate of 
105 NC-1 per metre. What are the force and torque experienced by a 
system having a total dipole moment equal to 10-7 Cm in the 
negative z-direction ? 

1.28 (a) A conductor A with a cavity as shown in Fig. 1.36(a) is given 
a charge Q. Show that the entire charge must appear on the outer 
surface of the conductor, (b) Another conductor B with charge q is 
inserted into the cavity keeping B insulated from A. Show that the 




total charge on the outside surface of A is Q + q [Fig. 1.36(b)]. (c) A 
sensitive instrument is to be shielded from the strong electrostatic 
fields in its environment. Suggest a possible way. 

9 Q+q 

A 



(a) (b) 

Figure 1.36 

1.29 A hollow charged conductor has a tiny hole cut into its surface. 
Show that the electric field in the hole is (o/2e 0) h, where fi is the 
unit vector in the outward normal direction, and o is the surface 
charge density near the hole. 

1.30 Obtain the formula for the electric field due to a long thin wire of 
uniform linear charge density E without using Gauss’s law. [Hint: Use 
Coulomb’s law directly and evaluate the necessary integral.] 

1.31 It is now believed that protons and neutrons (which constitute 
nuclei of ordinary matter) are themselves built out of more elementary 
units called quarks. A proton and a neutron consist of three quarks 
each. Two types of quarks, the so called ‘up’ quark (denoted by u) of 
charge + (2/3) e, and the ‘down’ quark (denoted by d) of charge (- 
1/3) e, together with electrons build up ordinary matter. (Quarks of 
other types have also been found which give rise to different unusual 
varieties of matter.) Suggest a possible quark composition of a proton 
and neutron. 

1.32 (a) Consider an arbitrary electrostatic field configuration. A small 
test charge is placed at a null point (i.e., where E = 0) of the 





configuration. Show that the equilibrium of the test charge is 
necessarily unstable. 

(b) Verify this result for the simple configuration of two charges of the 
same magnitude and sign placed a certain distance apart. 

1.33 A particle of mass m and charge (-q) enters the region between 
the two charged plates initially moving along x-axis with speed vx 
(like particle 1 in Fig. 1.33). The length of plate is L and an uniform 
electric field E is maintained between the plates. Show that the 
vertical deflection of the particle at the far edge of the plate is 
qEL2/(2m vx2). 

Compare this motion with motion of a projectile in gravitational field 
discussed in Section 4.10 of Class XI Textbook of Physics. 

1.34 Suppose that the particle in Exercise in 1.33 is an electron 
projected with velocity v x = 2.0 * 10 6 m s _1 . If E between the plates 

separated by 0.5 cm is 9.1 * 10 2 N/C, where will the electron strike 
the upper plate? (|e|=1.6 * 10 -19 C, m e = 9.1 * 10 -31 kg ) 
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Chapter Two 


Electrostatic Potential and Capacitance 


2.1 Introduction 

In Chapters 6 and 8 (Class XI), the notion of potential energy was 
introduced. When an external force does work in taking a body from a 
point to another against a force like spring force or gravitational force, 
that work gets stored as potential energy of the body. When the external 
force is removed, the body moves, gaining kinetic energy and losing an 
equal amount of potential energy. The sum of kinetic and potential 
energies is thus conserved. Forces of this kind are called conservative 
forces. Spring force and gravitational force are examples of conservative 
forces. 

Coulomb force between two (stationary) charges is also a conservative 
force. This is not surprising, since both have inverse-square dependence 
on distance and differ mainly in the proportionality constants - the 
masses in the gravitational law are replaced by charges in Coulomb’s 
law. Thus, like the potential energy of a mass in a gravitational field, we 
can define electrostatic potential energy of a charge in an electrostatic 
field. 

Consider an electrostatic field E due to some charge configuration. First, 
for simplicity, consider the field E due to a charge Q placed at the origin. 
Now, imagine that we bring a test charge q from a point R to a point P 
against the repulsive force on it due to the charge Q. With reference to 
Fig. 2.1, this will happen if Q and q are both positive or both negative. For 
definiteness, let us take Q, q > 0. 




Two remarks may be made here. First, we assume that the test charge q 
is so small that it does not disturb the original configuration, namely the 
charge Q at the origin (or else, we keep Q fixed at the origin by some 
unspecified force). Second, in bringing the charge q from R to P, we 
apply an external force F ext just enough to counter the repulsive electric 

force F e (i.e, F ext = -F E ). This means there is no net force on or 

acceleration of the charge q when it is brought from R to P, i.e., it is 
brought with infinitesimally slow constant speed. In this situation, work 
done by the external force is the negative of the work done by the electric 
force, and gets fully stored in the form of potential energy of the charge q. 
If the external force is removed on reaching P, the electric force will take 
the charge away from Q - the stored energy (potential energy) at P is 
used to provide kinetic energy to the charge q in such a way that the sum 
of the kinetic and potential energies is conserved. 



Figure 2.1 A test charge q (> 0) is moved from the point R to the point P against the repulsive 
force on it by the charge Q (> 0) placed at the origin. 

Thus, work done by external forces in moving a charge q from R to P is 
Wrp = j glr 

R 

P 

= -jF £ glr (2.1) 

R 


This work done is against electrostatic repulsive force and gets stored as 




potential energy. 

At every point in electric field, a particle with charge q possesses a 
certain electrostatic potential energy, this work done increases its 
potential energy by an amount equal to potential energy difference 
between points R and R 

Thus, potential energy difference 

AU - U p - U R - Wpp ( 2 . 2 ) 

(Note here that this displacement is in an opposite sense to the electric 
force and hence work done by electric field is negative, i.e., -WRP.) 

Therefore, we can define electric potential energy difference between two 
points as the work required to be done by an external force in moving 
(without accelerating) charge q from one point to another for electric field 
of any arbitrary charge configuration. 

Two important comments may be made at this stage: 

(i) The right side of Eq. (2.2) depends only on the initial and final 
positions of the charge. It means that the work done by an electrostatic 
field in moving a charge from one point to another depends only on the 
initial and the final points and is independent of the path taken to go from 
one point to the other. This is the fundamental characteristic of a 
conservative force. The concept of the potential energy would not be 
meaningful if the work depended on the path. The path-independence of 
work done by an electrostatic field can be proved using the Coulomb’s 
law. We omit this proof here. 

(ii) Equation (2.2) defines potential energy difference in terms of the 
physically meaningful quantity work. Clearly, potential energy so defined 
is undetermined to within an additive constant.What this means is that 
the actual value of potential energy is not physically significant; it is only 



the difference of potential energy that is significant. We can always add 
an arbitrary constant a to potential energy at every point, since this will 
not change the potential energy difference: 

(U r + ex) - [U R - a) = U P - U R 

Put it differently, there is a freedom in choosing the point where potential 
energy is zero. A convenient choice is to have electrostatic potential 
energy zero at infinity. With this choice, if we take the point R at infinity, 
we get from Eq. (2.2) 

VV' ::r - Up - LL - Up (2.3) 

Since the point P is arbitrary, Eq. (2.3) provides us with a definition of 
potential energy of a charge q at any point. Potential energy of charge q 
at a point (in the presence of field due to any charge configuration) is the 
work done by the external force (equal and opposite to the electric force) 
in bringing the charge q from infinity to that point. 



Count Alessandro Volta (1745 - 1827) Italian physicist, professor at 
Pavia. Volta established that the animal electricity observed by Luigi 
Galvani, 1737-1798, in experiments with frog muscle tissue placed in 
contact with dissimilar metals, was not due to any exceptional 
property of animal tissues but was also generated whenever any wet 
body was sandwiched between dissimilar metals. This led him to 
develop the first voltaic pile, or battery, consisting of a large stack of 



moist disks of cardboard (electrlyte) sandwiched between disks of 
metal (electrodes). 


2.2 Electrostatic Potential 


Consider any general static charge configuration. We define potential 
energy of a test charge q in terms of the work done on the charge q. This 
work is obviously proportional to q, since the force at any point is qE, 
where E is the electric field at that point due to the given charge 
configuration. It is, therefore, convenient to divide the work by the amount 
of charge q, so that the resulting quantity is independent of q. In other 
words, work done per unit test charge is characteristic of the electric field 
associated with the charge configuration. This leads to the idea of 
electrostatic potential V due to a given charge configuration. From Eq. 
(2.1), we get: 

Work done by external force in bringing a unit positive charge from point 
R to P 


= VP - VR 


Up-Ur ' 

<? j 


(2.4) 


where VP and VR are the electrostatic potentials at P and R, respectively. 
Note, as before, that it is not the actual value of potential but the potential 
difference that is physically significant. If, as before, we choose the 
potential to be zero at infinity, Eq. (2.4) implies: 

Work done by an external force in bringing a unit positive charge from 
infinity to a point = electrostatic potential (V) at that point. 

In other words, the electrostatic potential (V) at any point in a region with 
electrostatic field is the work done in bringing a unit positive charge 
(without acceleration) from infinity to that point. 






The qualifying remarks made earlier regarding potential energy also 
apply to the definition of potential. To obtain the work done per unit test 
charge, we should take an infinitesimal test charge 5q, obtain the work 
done 5W in bringing it from infinity to the point and determine the ratio 
5W/5q. Also, the external force at every point of the path is to be equal 
and opposite to the electrostatic force on the test charge at that point. 



© 
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Figure 2.2 Work done on a test charge q by the electrostatic field due to any given charge 
configuration is independent of the path, and depends only on its initial and final positions. 

2.3 Potential due to a Point Charge 

Consider a point charge Q at the origin (Fig. 2.3). For definiteness, take 
Q to be positive. We wish to determine the potential at any point P with 
position vector r from the origin. For that we must calculate the work done 
in bringing a unit positive test charge from infinity to the point P. For Q > 
0, the work done against the repulsive force on the test charge is 
positive. Since work done is independent of the path, we choose a 
convenient path - along the radial direction from infinity to the point P. 






Figure 2.3 Work done in bringing a unit positive test charge from infinity to the point P, against the 
repulsive force of charge Q (Q> 0), is the potential at P due to the charge Q. 


At some intermediate point P' on the path, the electrostatic force on a unit 

0x1 

positive charge is —- r (2.5) 

4 rc v r 

where r' is the unit vector along OP'. Work done against this force from r' 
to r' + Ar' is 


AW = - 


0 


4tt /■ 


Ar 


( 2 . 6 ) 


The negative sign appears because for Ar' < 0, AW is positive. Total work 
done (W) by the external force is obtained by integrating Eq. (2.6) from r' 
= oo to r' = r, 
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4ite 0 r 


f2 


dr' 


9 r 

4nq ) r / « 


Q 

4nq,r 


(2.7) 


This, by definition is the potential at P due to the charge Q 

V(r) - - 9 (2.8) 

4^; r 


Equation (2.8) is true for any sign of the charge Q, though we considered 
Q > 0 in its derivation. For Q < 0, V < 0, i.e., work done (by the external 









force) per unit positive test charge in bringing it from infinity to the point is 
negative. This is equivalent to saying that work done by the electrostatic 
force in bringing the unit positive charge form infinity to the point P is 
positive. [This is as it should be, since for Q < 0, the force on a unit 
positive test charge is attractive, so that the electrostatic force and the 
displacement (from infinity to P) are in the same direction.] Finally, we 
note that Eq. (2.8) is consistent with the choice that potential at infinity be 
zero. 



Figure 2.4 Variation of potential V with r[in units of(Q/4neO) m-1] (blue curve) and field with r[in 
units of (Q/4tte0) m-2] (black curve) for a point charge Q. 

Figure (2.4) shows how the electrostatic potential (« 1/r) and the 
electrostatic field (« 1/r 2 ) varies with r. 

Example 2.1 

(a) Calculate the potential at a point P due to a charge of 4 x 10 _7 C 
located 9 cm away. 








(b) Hence obtain the work done in bringing a charge of 2 x 10 -9 C 
from infinity to the point P. Does the answer depend on the path along 
which the charge is brought? 


Solution 


(a) V 


1 



^ = 9 * 10 9 Nm 2 C 2 
r 


4x 1Q~ 7 C 
0.09 m 


= 4 x 10 4 V 


(b) W = qV = 2 x io- 9 C x 4 x io 4 V 
= 8 x 10“ 5 J 

No, work done will be path independent. Any arbitrary infinitesimal 
path can be resolved into two perpendicular displacements: One 
along r and another perpendicular to r. The work done corresponding 
to the later will be zero. 


2.4 Potential due to an Electric Dipole 

As we learnt in the last chapter, an electric dipole consists of two charges 
q and -q separated by a (small) distance 2a. Its total charge is zero. It is 
characterised by a dipole moment vector p whose magnitude is q x 2a 
and which points in the direction from -q to q (Fig. 2.5). We also saw that 
the electric field of a dipole at a point with position vector r depends not 
just on the magnitude r, but also on the angle between r and p. Further, 

the field falls off, at large distance, not as 1/r 2 (typical of field due to a 

single charge) but as 1/r 3 . We, now, determine the electric potential due 
to a dipole and contrast it with the potential due to a single charge. 

As before, we take the origin at the centre of the dipole. Now we know 





that the electric field obeys the superposition principle. Since potential is 
related to the work done by the field, electrostatic potential also follows 
the superposition principle. Thus, the potential due to the dipole is the 
sum of potentials due to the charges q and -q 


V = 


1 

4Jtq, 




(2.9) 


where r 1 and r 2 are the distances of the point P from q and -q, 
respectively. 


Now, by geometry, 
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Figure 2.5 Quantities involved in the calculation of potential due to a dipole. 


r 2 = r 2 - a 2 - 2ar cos0 


r 2 = r 2 - a 2 - 2ar cosG (2.10) 


We take r much greater than a (r > a) and retain terms only upto the 
first order in a/r 








..2 ,.2 , 2acosg a 


rf = r z 1 


sr2 2aco3g j (211) 


Similarly, 


r* =r 2 1 + 


2 a cosg 


( 2 . 12 ) 


Using the Binomial theorem and retaining terms upto the first order in a/r 
; we obtain, 


1 \ ( 2a cos q 

J\ r l, r 


4l+“cosgj [2.13(a)] 


1 If 2acosq) 1/2 If a ) ro , m .. 
— = - 1 +-- =- 1-—cosg [2.13(b)] 

r 2 r V r ) r \ r 

Using Eqs. (2.9) and (2.13) and p = 2qa, we get 

= q 2acosg = pcosg 

V a * ,.2 a * ,.2 V Z - 1 V 


4*.% r 


4-Ji^r 


Now, p cos 0 = p.r 

where y is the unit vector along the position vector OP. 
The electric potential of a dipole is then given by 

v = i^ ;(r>>a)(2 - 15) 


Equation (2.15) is, as indicated, approximately true only for distances 
large compared to the size of the dipole, so that higher order terms in 
a/r are negligible. For a point dipole p at the origin, Eq. (2.15) is, however, 



exact. 


From Eq. (2.15), potential on the dipole axis (9 = 0, tt ) is given by 


V' = I 


1 P 
4k£' r- 


(2.16) 


(Positive sign for 0 = 0, negative sign for 9 = tt.) The potential in the 
equatorial plane (9 = tt/ 2) is zero. 

The important contrasting features of electric potential of a dipole from 
that due to a single charge are clear from Eqs. (2.8) and (2.15): 

(i) The potential due to a dipole depends not just on r but also on the 
angle between the position vector r and the dipole moment vector p. (It is, 
however, axially symmetric about p. That is, if you rotate the position 
vector r about p, keeping 0 fixed, the points corresponding to P on the 
cone so generated will have the same potential as at P.) 


(ii) The electric dipole potential falls off, at large distance, as 1/r 2 , not as 
1/r, characteristic of the potential due to a single charge. (You can refer to 

the Fig. 2.5 for graphs of 1/r 2 versus r and 1/r versus r, drawn there in 
another context.) 


2.5 Potential due to a System of Charges 

Consider a system of charges q 1 , q 2 ,- - -, q n with position vectors r 1s r 2 ,..., 
r n relative to some origin (Fig. 2.6). The potential V 1 at P due to the 
charge q 1 is 

v,-—^ 

4Jtf„ r„. 

where r 1P is the distance between q 1 and P. 




Similarly, the potential V 2 at P due to q 2 and V 3 due to q 3 are given by 


'\nt\,r 2V 

where r 2P and r 3P are the distances of P from charges q 2 and q 3 , 

respectively; and so on for the potential due to other charges. By the 
superposition principle, the potential V at P due to the total charge 
configuration is the algebraic sum of the potentials due to the individual 
charges 

V = V 1 + V 2 + ... + V n (2.17) 


4iu 


Ql + ^L + ._|_ _^n_ 


■o 


^1P 


i: 


2P 


r. 


(2.18) 


rtf 3 / 


If we have a continuous charge distribution characterised by a charge 
density p(r), we divide it, as before, into small volume elements each of 
size Av and carrying a charge pAv. We then determine the potential due 
to each volume element and sum (strictly speaking , integrate) over all 
such contributions, and thus determine the potential due to the entire 
distribution. 





Figure 2.6 Potential at a point due to a system of charges is the sum of potentials due to individual 

charges. 











We have seen in Chapter 1 that for a uniformly charged spherical shell, 
the electric field outside the shell is as if the entire charge is concentrated 
at the centre. Thus, the potential outside the shell is given by 

v = . 1 -- (r > R) [2.19(a)] 

4 K£ 0 r 

where q is the total charge on the shell and R its radius. The electric field 
inside the shell is zero. This implies (Section 2.6) that potential is 
constant inside the shell (as no work is done in moving a charge inside 
the shell), and, therefore, equals its value at the surface, which is 

v - , l [2.19(b)] 

4irr 0 R 

Example 2.2 Two charges 3 x 10 -8 C and -2 x 10 -8 C are located 
15 cm apart. At what point on the line joining the two charges is the 
electric potential zero? Take the potential at infinity to be zero. 

Solution Let us take the origin O at the location of the positive 
charge. The line joining the two charges is taken to be the x-axis; the 
negative charge is taken to be on the right side of the origin (Fig. 2.7). 

o p a 

O- X -Q 

3 xlO s C 15 cm -2xlO s C 

Figure 2.7 

Let P be the required point on the x-axis where the potential is zero. If 
x is the x-coordinate of P, obviously x must be positive. (There is no 
possibility of potentials due to the two charges adding up to zero for 
x < 0.) If x lies between O and A, we have 




4tts 


o L 


3x10“ 8 2 x JO -8 


vxlO -2 (15- a )x 1 O' 


= 0 


where x is in cm. That is, 


3 2 

= 0 

x 15 - x 

which gives x = 9 cm. 

If x lies on the extended line OA, the required condition is 

3 2 

_ = 0 
x x - 15 

which gives 

x = 45 cm 


Thus, electric potential is zero at 9 cm and 45 cm away from the 
positive charge on the side of the negative charge. Note that the 
formula for potential used in the calculation required choosing 
potential to be zero at infinity. 


PHYSICS 

Electric potential, equipotential surfaces: 

http://video.mit.edu/watch/4-electrostatic-potential-elctric-energy-ev- 

conservative-field-equipotential-sufaces-12584/ 


Example 2.3 Figures 2.8 (a) and (b) show the field lines of a positive 










and negative point charge respectively. 

Q 

a \ 

P 

- © ->- ^ " O 


(a) (b) 

Figure 2.8 

(a) Give the signs of the potential difference V P - V Q ; V B - V A . 

(b) Give the sign of the potential energy difference of a small negative 
charge between the points Q and P; A and B. 

(c) Give the sign of the work done by the field in moving a small 
positive charge from Q to P. 

(d) Give the sign of the work done by the external agency in moving a 
small negative charge from B to A. 

(e) Does the kinetic energy of a small negative charge increase or 
decrease in going from B to A? 

Solution 

(a) As V « 1 , VP > VQ. Thus, (VP - VQ) is positive. Also VB is less 

r 

negative than VA . Thus, VB > VA or (VB - VA) is positive. 

(b) A small negative charge will be attracted towards positive charge. 




The negative charge moves from higher potential energy to lower 
potential energy. Therefore the sign of potential energy difference of a 
small negative charge between Q and P is positive. 

Similarly, (P.E.) A > (P.E.) B and hence sign of potential energy 
differences is positive. 

(c) In moving a small positive charge from Q to P, work has to be 
done by an external agency against the electric field. Therefore, work 
done by the field is negative. 

(d) In moving a small negative charge from B to A work has to be 
done by the external agency. It is positive. 

(e) Due to force of repulsion on the negative charge, velocity 
decreases and hence the kinetic energy decreases in going from B to 
A. 


2.6 Equipotential Surfaces 

An equipotential surface is a surface with a constant value of potential at 
all points on the surface. For a single charge q, the potential is given by 
Eq. (2.8): 

v- -1 q 

■Irrf,;, r 

This shows that V is a constant if r is constant. Thus, equipotential 
surfaces of a single point charge are concentric spherical surfaces 
centred at the charge. 

Now the electric field lines for a single charge q are radial lines starting 
from or ending at the charge, depending on whether q is positive or 



negative. Clearly, the electric field at every point is normal to the 
equipotential surface passing through that point. This is true in general: 
for any charge configuration, equipotential surface through a point is 
normal to the electric field at that point. The proof of this statement is 
simple. 

If the field were not normal to the equipotential surface, it would have 
non-zero component along the surface. To move a unit test charge 
against the direction of the component of the field, work would have to be 
done. But this is in contradiction to the definition of an equipotential 
surface: there is no potential difference between any two points on the 
surface and no work is required to move a test charge on the surface. 
The electric field must, therefore, be normal to the equipotential surface 
at every point. Equipotential surfaces offer an alternative visual picture in 
addition to the picture of electric field lines around a charge configuration. 





(b) 


Figure 2.9 For a single charge q (a) equipotential surfaces are spherical surfaces centred at the 
charge, and (b) electric field lines are radial, starting from the charge if q > 0. 



FIGURE 2.10 Equipotential surfaces for a uniform electric field. 

For a uniform electric field E, say, along the .v-axis, the equipotential 
surfaces are planes normal to the .v-axis, i.e., planes parallel to the y-z 













































plane (Fig. 2.10). Equipotential surfaces for (a) a dipole and (b) two 
identical positive charges are shown in Fig. 2.11. 




Figure 2.11 Some equipotential surfaces for (a) a dipole, (b) two identical positive charges. 


2.6.1 Relation between field and potential 

Consider two closely spaced equipotential surfaces A and B (Fig. 2.12) 
with potential values V and V + 5 V, where 5 V is the change in V in the 
direction of the electric field E. Let P be a point on the surface B. 5 I is the 
perpendicular distance of the surface A from P. Imagine that a unit 
positive charge is moved along this perpendicular from the surface B to 
surface A against the electric field. The work done in this process is |E|5 
I. 

This work equals the potential difference 
VA-VB. 

Thus, 

|E|5 I = V - (V + 5V)= - 5V 


i.e., |E|= 


dl 





Since 5V is negative, 5V = - |5V|. we can rewrite 
Eq (2.20) as 


E 



We thus arrive at two important conclusions concerning the relation 
between electric field and potential: 



Equipotcntials 


Figure 2.12 From the potential to the field. 


(i) Electric field is in the direction in which the potential decreases 
steepest. 

(ii) Its magnitude is given by the change in the magnitude of potential per 
unit displacement normal to the equipotential surface at the point. 

2.7 Potential Energy of a System of Charges 

Consider first the simple case of two charges q-|and q 2 with position 
vector r 1 and r 2 relative to some origin. Let us calculate the work done 











(externally) in building up this configuration. This means that we consider 
the charges q 1 and q 2 initially at infinity and determine the work done by 

an external agency to bring the charges to the given locations. Suppose, 
first the charge q 1 is brought from infinity to the point r^ There is no 

external field against which work needs to be done, so work done in 
bringing q 1 from infinity to r 1 is zero. This charge produces a potential in 

space given by 


v; = 


1 <3i 


4ne 0 r lp 


where r 1P is the distance of a point P in space from the location of q-|. 
From the definition of potential, work done in bringing charge q 2 from 
infinity to the point r 2 is q 2 times the potential at r 2 due to q^ 


work done on q 2 = 


1 2 

4ite 0 r 12 


where r 12 is the distance between points 1 and 2. 


Since electrostatic force is conservative, this work gets stored in the form 
of potential energy of the system. Thus, the potential energy of a system 
of two charges q-| and q 2 is 

j ~ 1 CLOo 

U =-(2.22) 

4ne r 

^■=0 1 12 

Obviously, if q 2 was brought first to its present location and q-| brought 

later, the potential energy U would be the same. More generally, the 
potential energy expression, 

Eq. (2.22), is unaltered whatever way the charges are brought to the 
specified locations, because of path-independence of work for 



electrostatic force. 

Equation (2.22) is true for any sign of q-|and q 2 . If q-|q 2 > 0, potential 

energy is positive. This is as expected, since for like charges (q-iq 2 > 0), 

electrostatic force is repulsive and a positive amount of work is needed to 
be done against this force to bring the charges from infinity to a finite 
distance apart. For unlike charges (q-| q 2 < 0), the electrostatic force is 

attractive. In that case, a positive amount of work is needed against this 
force to take the charges from the given location to infinity. In other 
words, a negative amount of work is needed for the reverse path (from 
infinity to the present locations), so the potential energy is negative. 

£h 

Qi 

Figure 2.13 Potential energy of a system of charges ql and q2 is directly proportional to the 
product of charges and inversely to the distance between them. 

Equation (2.22) is easily generalised for a system of any number of point 
charges. Let us calculate the potential energy of a system of three 
charges q.,, q 2 and q 3 located at r 1; r 2 , r 3 , respectively. To bring q 1 first 

from infinity to r 1 , no work is required. Next we bring q 2 from infinity to r 2 . 

As before, work done in this step is 

q 2 V (rj - --— ( 2 . 23 ) 

4 7Z£,, r,.. 


The charges q 1 and q 2 produce a potential, which at any point P is given 
by 



Work done next in bringing q 3 from infinity to the point r 3 is q 3 times V-| 2 
at r 3 


*33^1.2 (*3) 
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(2.25) 


The total work done in assembling the charges at the given locations is 
obtained by adding the work done in different steps [Eq. (2.23) and Eq. 
(2-25)], 


U 


1 [ M2 , ' 

4Ke 0 V. ^12 r i3 ^23 ) 


(2.26) 


Again, because of the conservative nature of the electrostatic force (or 
equivalently, the path independence of work done), the final expression 
for U, Eq. (2.26), is independent of the manner in which the configuration 
is assembled. The potential energy is characteristic of the present state 
of configuration, and not the way the state is achieved. 



Figure 2.14 Potential energy of a system of three charges is given by Eq. (2.26), with the notation 

given in the figure. 

Example 2.4 Four charges are arranged at the corners of a square 
ABCD of side d, as shown in Fig. 2.15.(a) Find the work required to 



















put together this arrangement, (b) A charge q 0 is brought to the 

centre E of the square, the four charges being held fixed at its 
corners. How much extra work is needed to do this? 



d 


o 

E 



Figure 2.15 

Solution 

(a) Since the work done depends on the final arrangement of the 
charges, and not on how they are put together, we calculate work 
needed for one way of putting the charges at A, B, C and D. 
Suppose, first the charge +q is brought to A, and then the charges 
-q, +q, and -q are brought to B, C and D, respectively. The total work 
needed can be calculated in steps: 

(i) Work needed to bring charge +q to A when no charge is present 
elsewhere: this is zero. 

(ii) Work needed to bring -q to B when +q is at A. This is given by 
(charge at B) x (electrostatic potential at B due to charge +q at A) 








= -q x 





I47re 0 d J 
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(iii) Work needed to bring charge +q to C when +q is at A and -q is at 
B. This is given by (charge at C) * (potential at C due to charges at A 
and B) 


= +q 


+q 

{ 4 Ke^d y 2 
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-q 
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-q 2 L 1 ^ 
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(iv) Work needed to bring -q to D when +q at A,-q at B, and +q at C. 
This is given by (charge at D) x (potential at D due to charges at A, B 
and C) 
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Add the work done in steps (i), (ii), (iii) and (iv). The total work 
required is 
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The work done depends only on the arrangement of the charges, and 
not how they are assembled. By definition, this is the total 
electrostatic energy of the charges. 































(Students may try calculating same work/energy by taking charges in 
any other order they desire and convince themselves that the energy 
will remain the same.) 

(b) The extra work necessary to bring a charge q 0 to the point E when 

the four charges are at A, B, C and D is q 0 * (electrostatic potential at 

E due to the charges at A, B, C and D). The electrostatic potential at 
E is clearly zero since potential due to A and C is cancelled by that 
due to B and D. Hence, no work is required to bring any charge to 
point E. 


2.8 Potential Energy in an External Field 

2.8.1 Potential energy of a single charge 

In Section 2.7, the source of the electric field was specified - the charges 
and their locations - and the potential energy of the system of those 
charges was determined. In this section, we ask a related but a distinct 
question. What is the potential energy of a charge q in a given field? This 
question was, in fact, the starting point that led us to the notion of the 
electrostatic potential (Sections 2.1 and 2.2). But here we address this 
question again to clarify in what way it is different from the discussion in 
Section 2.7. 

The main difference is that we are now concerned with the potential 
energy of a charge (or charges) in an external field. The external field E 
is not produced by the given charge(s) whose potential energy we wish to 
calculate. E is produced by sources external to the given charge(s).The 
external sources may be known, but often they are unknown or 
unspecified; what is specified is the electric field E or the electrostatic 



potential V due to the external sources. We assume that the charge q 
does not significantly affect the sources producing the external field. This 
is true if q is very small, or the external sources are held fixed by other 
unspecified forces. Even if q is finite, its influence on the external sources 
may still be ignored in the situation when very strong sources far away at 
infinity produce a finite field E in the region of interest. Note again that we 
are interested in determining the potential energy of a given charge q 
(and later, a system of charges) in the external field; we are not 
interested in the potential energy of the sources producing the external 
electric field. 

The external electric field E and the corresponding external potential V 
may vary from point to point. By definition, V at a point P is the work done 
in bringing a unit positive charge from infinity to the point P. 

(We continue to take potential at infinity to be zero.) Thus, work done in 
bringing a charge q from infinity to the point P in the external field is qV. 
This work is stored in the form of potential energy of q. If the point P has 
position vector r relative to some origin, we can write: 

Potential energy of q at r in an external field 

= qV(r) (2.27) 

where V(r) is the external potential at the point r. 

Thus, if an electron with charge q = e = 1.6x10 -19 C is accelerated by a 
potential difference of AV = 1 volt, it would gain energy of qAV = 1.6 * 

10 -19 J. This unit of energy is defined as 1 electron volt or leV, i.e., 

1 eV=1.6 x 10 -19 J. The units based on eV are most commonly used in 

atomic, nuclear and particle physics, (1 keV = 10 3 eV = 1.6 x 10 -16 J, 1 
MeV 

= 10 6 eV = 1.6 x 10- 13 J, 1 GeV = 10 9 eV = 1.6 x iq- 10 J and 1 TeV = 



10 12 eV 

= 1.6 x 10 _7 J). [This has already been defined on Page 117, XI Physics 
Part I, Table 6.1.] 

2.8.2 Potential energy of a system of two charges in an 
external field 

Next, we ask: what is the potential energy of a system of two charges q-| 
and q 2 located at ^and r 2 , respectively, in an external field? First, we 
calculate the work done in bringing the charge q-| from infinity to r-|. Work 
done in this step is q 1 V(r 1 ), using Eq. (2.27). Next, we consider the work 
done in bringing q 2 to r 2 . In this step, work is done not only against the 
external field E but also against the field due to q-|. 

Work done on q 2 against the external field 

= q 2 V(r 2 ) 

Work done on q 2 against the field due to q 1 

4rc£,,r,, 

where r 12 is the distance between q 1 and q 2 . We have made use of Eqs. 

(2.27) and (2.22). By the superposition principle for fields, we add up the 
work done on q 2 against the two fields (E and that due to q-|): 

Work done in bringing q 2 to r 2 

= q,V(T,)~ q<h (2.28) 

4 TU-Jvi 



Thus, 

Potential energy of the system 
= the total work done in assembling the configuration 


< 7 1 V(r 1 ) + q,yir,)+-^ 1<i2 
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(2.29) 


Example 2.5 

(a) Determine the electrostatic potential energy of a system 
consisting of two charges 7 pC and -2 pC (and with no external field) 
placed at (-9 cm, 0, 0) and (9 cm, 0, 0) respectively. 

(b) How much work is required to separate the two charges infinitely 
away from each other? 

(c) Suppose that the same system of charges is now placed in an 

external electric field E = A (1/r 2 ); A = 9 * 10 5 C rrr 2 . What would the 
electrostatic energy of the configuration be? 

Solution 


(a) U 




- 9x10" 


7 <(-2) X 10 1 
0.18 


= -0.7 J. 


(b)W = U 2 -U 1 = 0-U = 0- (-0.7) = 0.7 J. 


(c) The mutual interaction energy of the two charges remains 
unchanged. In addition, there is the energy of interaction of the two 
charges with the external electric field. We find, 




0.09m 


i i\ 


2pC 

0.09m 


and the net electrostatic energy is 
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0.7 J 


70-20-0.7 = -l 0,0,1 


2.8.3 Potential energy of a dipole in an external field 

Consider a dipole with charges q-| = +q and q 2 = -q placed in a uniform 
electric field E, as shown in Fig. 2.16. 

As seen in the last chapter, in a uniform electric field, the dipole 
experiences no net force; but experiences a torque t given by 

t = pxE (2.30) which will tend to rotate it (unless p is parallel or 
antiparallel to E). Suppose an external torque T ext is applied in such a 

manner that it just neutralises this torque and rotates it in the plane of 
paper from angle 0 O to angle 01 at an infinitesimal angular speed and 

without angular acceleration. The amount of work done by the external 
torque will be given by 

W= P t aa lq)dg = P pE Sin qdq 
- pB ( cos 0 — COS i 9, ) (2.31) 

This work is stored as the potential energy of the system. We can then 
associate potential energy U(0) with an inclination 0 of the dipole. Similar 
to other potential energies, there is a freedom in choosing the angle 
where the potential energy U is taken to be zero. A natural choice is to 
take 

0 O = tt / 2. (An explanation for it is provided towards the end of 
discussion.) We can then write, 
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Figure 2.16 Potential energy of a dipole in a uniform external field. 


U(q) = P E 
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pE cos g=-p.E (2.32) 


This expression can alternately be understood also from Eq. (2.29). We 
apply Eq. (2.29) to the present system of two charges +q and -q. The 
potential energy expression then reads 

[/'(£) = q[V[i.) V("r i] (/ (2.33) 

4nq x 2 a 


Here, ^ and r 2 denote the position vectors of +q and -q. Now, the 
potential difference between positions r 1 and r 2 equals the work done 
in bringing a unit positive charge against field from r 2 to r^ The 
displacement parallel to the force is 2a cosG. Thus, [V^-V (r 2 )] = 

-E x 2a cos0 . We thus obtain, 
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(2.34) 











We note that U' (0) differs from U(9) by a quantity which is just a constant 
for a given dipole. Since a constant is insignificant for potential energy, 
we can drop the second term in Eq. (2.34) and it then reduces to Eq. 
(2.32). 

We can now understand why we took 0 o =tt/2. In this case, the work done 

against the external field E in bringing +q and - q are equal and opposite 
and cancel out, i.e., q [V (r-,) - V (r 2 )]=0. 

Example 2.6 A molecule of a substance has a permanent electric 

dipole moment of magnitude 10 -29 C m. A mole of this substance is 
polarised (at low temperature) by applying a strong electrostatic field 

of magnitude 10 6 V rrr 1 . The direction of the field is suddenly 
changed by an angle of 60°. Estimate the heat released by the 
substance in aligning its dipoles along the new direction of the field. 
For simplicity, assume 100% polarisation of the sample. 

Solution Here, dipole moment of each molecules = 10“ 29 C m 

As 1 mole of the substance contains 6 x 10 23 molecules, 

total dipole moment of all the molecules, p = 6 x 10 23 x IO -29 C m 
= 6 x 10 -6 C m 

Initial potential energy, Ui = -pE cos 0 = -6x10 _6 xio 6 cos 0° = -6 J 

Final potential energy (when 0 = 60°), Uf = -6 x io -6 x io 6 cos 60° = 
-3 J 

Change in potential energy = -3 J - (-6J) = 3 J 

So, there is loss in potential energy. This must be the energy 
released by the substance in the form of heat in aligning its dipoles. 



2.9 Electrostatics of Conductors 

Conductors and insulators were described briefly in Chapter 1. 
Conductors contain mobile charge carriers. In metallic conductors, these 
charge carriers are electrons. In a metal, the outer (valence) electrons 
part away from their atoms and are free to move. These electrons are 
free within the metal but not free to leave the metal. The free electrons 
form a kind of ‘gas’; they collide with each other and with the ions, and 
move randomly in different directions. In an external electric field, they 
drift against the direction of the field. The positive ions made up of the 
nuclei and the bound electrons remain held in their fixed positions. In 
electrolytic conductors, the charge carriers are both positive and negative 
ions; but the situation in this case is more involved - the movement of the 
charge carriers is affected both by the external electric field as also by 
the 

so-called chemical forces (see Chapter 3). We shall restrict our 
discussion to metallic solid conductors. Let us note important results 
regarding electrostatics of conductors. 

1. Inside a conductor, electrostatic field is zero 

Consider a conductor, neutral or charged. There may also be an external 
electrostatic field. In the static situation, when there is no current inside or 
on the surface of the conductor, the electric field is zero everywhere 
inside the conductor. This fact can be taken as the defining property of a 
conductor. A conductor has free electrons. As long as electric field is not 
zero, the free charge carriers would experience force and drift. In the 
static situation, the free charges have so distributed themselves that the 
electric field is zero everywhere inside. Electrostatic field is zero inside a 
conductor. 


2. At the surface of a charged conductor, electrostatic field 
must be normal to the surface at every point 

If E were not normal to the surface, it would have some non-zero 
component along the surface. Free charges on the surface of the 
conductor would then experience force and move. In the static situation, 
therefore, E should have no tangential component. Thus electrostatic 
field at the surface of a charged conductor must be normal to the surface 
at every point. (For a conductor without any surface charge density, field 
is zero even at the surface.) See result 5. 

3. The interior of a conductor can have no excess charge in 
the static situation 

A neutral conductor has equal amounts of positive and negative charges 
in every small volume or surface element. When the conductor is 
charged, the excess charge can reside only on the surface in the static 
situation. This follows from the Gauss’s law. Consider any arbitrary 
volume element v inside a conductor. On the closed surface S bounding 
the volume element v, electrostatic field is zero. Thus the total electric 
flux through S is zero. Hence, by Gauss’s law, there is no net charge 
enclosed by S. But the surface S can be made as small as you like, i.e., 
the volume v can be made vanishingly small. This means there is no net 
charge at any point inside the conductor, and any excess charge must 
reside at the surface. 

4. Electrostatic potential is constant throughout the volume 
of the conductor and has the same value (as inside) on its 
surface 

This follows from results 1 and 2 above. Since E = 0 inside the conductor 
and has no tangential component on the surface, no work is done in 
moving a small test charge within the conductor and on its surface. That 


is, there is no potential difference between any two points inside or on the 
surface of the conductor. Hence, the result. If the conductor is charged, 
electric field normal to the surface exists; this means potential will be 
different for the surface and a point just outside the surface. 

In a system of conductors of arbitrary size, shape and 
charge configuration, each conductor is characterised by a constant 
value of potential, but this constant may differ from one conductor to 
the other. 

5. Electric field at the surface of a charged conductor 

E = — ri (2.35) 

where a is the surface charge density and n is a unit vector normal to 
the surface in the outward direction. 

To derive the result, choose a pill box (a short cylinder) as the Gaussian 
surface about any point P on the surface, as shown in Fig. 2.17. The pill 
box is partly inside and partly outside the surface of the conductor. It has 
a small area of cross section 5 S and negligible height. 

Just inside the surface, the electrostatic field is zero; just outside, the field 
is normal to the surface with magnitude E. Thus, the contribution to the 
total flux through the pill box comes only from the outside (circular) cross- 
section of the pill box. This equals ± E5S (positive for o > 0, negative for 
o < 0), since over the small area 5S, E may be considered constant and 
E and 5S are parallel or antiparallel. The charge enclosed by the pill box 
is o5S. 


By Gauss’s law 
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Including the fact that electric field is normal to the surface, we get the 
vector relation, Eq. (2.35), which is true for both signs of o. For a > 0, 
electric field is normal to the surface outward; for a < 0, electric field is 
normal to the surface inward. 
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Figure 2.17 The Gaussian surface (a pill box) chosen to derive Eq. (2.35) for electric field at the 

surface of a charged conductor. 


6. Electrostatic shielding 

Consider a conductor with a cavity, with no charges inside the cavity. A 
remarkable result is that the electric field inside the cavity is zero, 
whatever be the size and shape of the cavity and whatever be the charge 
on the conductor and the external fields in which it might be placed. We 
have proved a simple case of this result already: the electric field inside a 
charged spherical shell is zero. The proof of the result for the shell makes 








use of the spherical symmetry of the shell (see Chapter 1). But the 
vanishing of electric field in the (charge-free) cavity of a conductor is, as 
mentioned above, a very general result. A related result is that even if the 
conductor is charged or charges are induced on a neutral conductor by 
an external field, all charges reside only on the outer surface of a 
conductor with cavity. 

E 



Figure 2.18 The electric field inside a cavity of any conductor is zero. All charges reside only on 


the outer surface of a conductor with cavity. (There are no charges placed in the cavity.) 
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Figure 2.19 Some important electrostatic properties of a conductor. 









The proofs of the results noted in Fig. 2.18 are omitted here, but we note 
their important implication. Whatever be the charge and field 
configuration outside, any cavity in a conductor remains shielded from 
outside electric influence: the field inside the cavity is always zero. This is 
known as electrostatic shielding. The effect can be made use of in 
protecting sensitive instruments from outside electrical influence. Figure 
2.19 gives a summary of the important electrostatic properties of a 
conductor. 


Example 2.7 

(a) A comb run through one’s dry hair attracts small bits of paper. 
Why? 

What happens if the hair is wet or if it is a rainy day? (Remember, a 
paper does not conduct electricity.) 

Solution 

(a) This is because the comb gets charged by friction. The molecules 
in the paper gets polarised by the charged comb, resulting in a net 
force of attraction. If the hair is wet, or if it is rainy day, friction 
between hair and the comb reduces. The comb does not get charged 
and thus it will not attract small bits of paper. 

(b) To enable them to conduct charge (produced by friction) to the 
ground; as too much of static electricity accumulated may result in 
spark and result in fire. 

(c) Reason similar to (b). 

(d) Current passes only when there is difference in potential. 



2.10 Dielectrics and Polarisation 

Dielectrics are non-conducting substances. In contrast to conductors, 
they have no (or negligible number of) charge carriers. Recall from 
Section 2.9 what happens when a conductor is placed in an external 
electric field. The free charge carriers move and charge distribution in the 
conductor adjusts itself in such a way that the electric field due to induced 
charges opposes the external field within the conductor. This happens 
until, in the static situation, the two fields cancel each other and the net 
electrostatic field in the conductor is zero. In a dielectric, this free 
movement of charges is not possible. It turns out that the external field 
induces dipole moment by stretching or re-orienting molecules of the 
dielectric. The collective effect of all the molecular dipole moments is net 
charges on the surface of the dielectric which produce a field that 
opposes the external field. Unlike in a conductor, however, the opposing 
field so induced does not exactly cancel the external field. It only reduces 
it. The extent of the effect depends on the nature of the dielectric. To 
understand the effect, we need to look at the charge distribution of a 
dielectric at the 
molecular level. 
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Figure 2.20 Difference in behaviour of a conductor and a dielectric in an external electric field. 

The molecules of a substance may be polar or non-polar. In a non-polar 
molecule, the centres of positive and negative charges coincide. The 
molecule then has no permanent (or intrinsic) dipole moment. Examples 
of non-polar molecules are oxygen (0 2 ) and hydrogen (H 2 ) molecules 

which, because of their symmetry, have no dipole moment. On the other 
hand, a polar molecule is one in which the centres of positive and 
negative charges are separated (even when there is no external field). 
Such molecules have a permanent dipole moment. An ionic molecule 
such as HCI or a molecule of water (H 2 0) are examples of polar 

molecules. 




















Non-polar 




Figure 2.21 Some examples of polar and non-polar molecules. 


In an external electric field, the positive and negative charges of a non¬ 
polar molecule are displaced in opposite directions. The displacement 
stops when the external force on the constituent charges of the molecule 
is balanced by the restoring force (due to internal fields in the molecule). 
The non-polar molecule thus develops an induced dipole moment. The 
dielectric is said to be polarised by the external field. We consider only 
the simple situation when the induced dipole moment is in the direction of 
the field and is proportional to the field strength. (Substances for which 
this assumption is true are called linear isotropic dielectrics.) The induced 
dipole moments of different molecules add up giving a net dipole moment 
of the dielectric in the presence of the external field. 

A dielectric with polar molecules also develops a net dipole moment in an 
external field, but for a different reason. In the absence of any external 
field, the different permanent dipoles are oriented randomly due to 


thermal agitation; so the total dipole moment is zero. When an external 
field is applied, the individual dipole moments tend to align with the field. 
When summed overall the molecules, there is then a net dipole moment 
in the direction of the external field, i.e., the dielectric is polarised. The 
extent of polarisation depends on the relative strength of two mutually 
opposite factors: the dipole potential energy in the external field tending 
to align the dipoles with the field and thermal energy tending to disrupt 
the alignment. There may be, in addition, the ‘induced dipole moment’ 
effect as for non-polar molecules, but generally the alignment effect is 
more important for polar molecules. 
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Figure 2.22 A dielectric develops a net dipole moment in an external electric field, (a) Non-polar 

molecules, (b) Polar molecules. 


Thus in either case, whether polar or non-polar, a dielectric develops a 
net dipole moment in the presence of an external field. The dipole 
moment per unit volume is called polarisation and is denoted by P. For 
linear isotropic dielectrics, 

P - / E (2.37) 

where xe is a constant characteristic of the dielectric and is known as the 
electric susceptibility of the dielectric medium. 

It is possible to relate x© to the molecular properties of the substance, but 
we shall not pursue that here. 

The question is: how does the polarised dielectric modify the original 
external field inside it? Let us consider, for simplicity, a rectangular 
dielectric slab placed in a uniform external field E 0 parallel to two of its 

faces. The field causes a uniform polarisation P of the dielectric. Thus 
every volume element Av of the slab has a dipole moment P Av in the 
direction of the field. The volume element Av is macroscopically small but 
contains a very large number of molecular dipoles. Anywhere inside the 
dielectric, the volume element Av has no net charge (though it has net 
dipole moment). This is, because, the positive charge of one dipole sits 
close to the negative charge of the adjacent dipole. However, at the 
surfaces of the dielectric normal to the electric field, there is evidently a 
net charge density. As seen in Fig 2.23, the positive ends of the dipoles 
remain unneutralised at the right surface and the negative ends at the left 
surface. The unbalanced charges are the induced charges due to the 
external field. 

Thus, the polarised dielectric is equivalent to two charged surfaces with 



induced surface charge densities, say ap 

and -ap. Clearly, the field produced by these surface charges opposes 
the external field. The total field in the dielectric is, thereby, reduced from 
the case when no dielectric is present. We should note that the surface 
charge density ±op arises from bound (not free charges) in the dielectric. 

2.11 Capacitors and Capacitance 

A capacitor is a system of two conductors separated by an insulator (Fig. 
2.24). The conductors have charges, say Q 1 and Q 2 , and potentials V 1 

and V 2 . Usually, in practice, the two conductors have charges Q 

and - Q, with potential difference V = V 1 - V 2 between them. We shall 

consider only this kind of charge configuration of the capacitor. (Even a 
single conductor can be used as a capacitor by assuming the other at 
infinity.) The conductors may be so charged by connecting them to the 
two terminals of a battery. Q is called the charge of the capacitor, though 
this, in fact, is the charge on one of the conductors - the total charge of 
the capacitor is zero. 
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Figure 2.23 A uniformly polarised dielectric amounts to induced surface charge density, but no 

volume charge density. 

The electric field in the region between the conductors is proportional to 
the charge Q. That is, if the charge on the capacitor is, say doubled, the 
electric field will also be doubled at every point. (This follows from the 
direct proportionality between field and charge implied by Coulomb’s law 
and the superposition principle.) Now, potential difference V is the work 
done per unit positive charge in taking a small test charge from the 
conductor 2 to 1 against the field. Consequently, V is also proportional to 
Q, and the ratio Q/V is a constant: 

C = — (2.38) 

y 

The constant C is called the capacitance of the capacitor. C is 









independent of Q or V, as stated above. The capacitance C depends only 
on the geometrical configuration (shape, size, separation) of the system 
of two conductors. [As we shall see later, it also depends on the nature of 
the insulator (dielectric) separating the two conductors.] The SI unit of 

capacitance is 1 farad (=1 coulomb volt' 1 ) or 1 F = 1 C V -1 . A capacitor 
with fixed capacitance is symbolically shown as —1|—, while the one with 
variable capacitance is shown as 



Conductor 1 Conductor 2 

Figure 2.24 A system of two conductors separated by an insulator forms a capacitor. 

Equation (2.38) shows that for large C, V is small for a given Q. This 
means a capacitor with large capacitance can hold large amount of 
charge Q at a relatively small V. This is of practical importance. High 
potential difference implies strong electric field around the conductors. A 
strong electric field can ionise the surrounding air and accelerate the 
charges so produced to the oppositely charged plates, thereby 
neutralising the charge on the capacitor plates, at least partly. In other 
words, the charge of the capacitor leaks away due to the reduction in 
insulating power of the intervening medium. 

The maximum electric field that a dielectric medium can withstand 
without break-down (of its insulating property) is called its dielectric 

strength; for air it is about 3 * 10 6 Vrrr 1 . For a separation between 
conductors of the order of 1 cm or so, this field corresponds to a potential 



difference of 3 * 10 4 V between the conductors. Thus, for a capacitor to 
store a large amount of charge without leaking, its capacitance should be 
high enough so that the potential difference and hence the electric field 
do not exceed the break-down limits. Put differently, there is a limit to the 
amount of charge that can be stored on a given capacitor without 
significant leaking. In practice, a farad is a very big unit; the most 

common units are its sub-multiples 1 pF = 10 -6 F, 1 nF = 10 -9 F, 1 pF = 

10 -12 F, etc. Besides its use in storing charge, a capacitor is a key 
element of most ac circuits with important functions, as described in 
Chapter 7. 

2.12 The Parallel Plate Capacitor 

A parallel plate capacitor consists of two large plane parallel conducting 
plates separated by a small distance (Fig. 2.25). We first take the 
intervening medium between the plates to be vacuum. The effect of a 
dielectric medium between the plates is discussed in the next section. Let 
A be the area of each plate and d the separation between them. The two 
plates have charges Q and -Q. Since d is much smaller than the linear 

dimension of the plates (d 2 « A), we can use the result on electric field 
by an infinite plane sheet of uniform surface charge density (Section 
1.15). Plate 1 has surface charge density a = Q/A and plate 2 has a 
surface charge density -a. Using Eq. (1.33), the electric field in different 
regions is: 

Outer region I (region above the plate 1), 

„ a <j * 

E ~ ~ ~ 0 <Z39) 

Outer region II (region below the plate 2), 
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<r 


(7 


0 (2.40) 


2r, 2c, : 

In the inner region between the plates 1 and 2, the electric fields due to 
the two charged plates add up, giving 

Q 


E = -7— + -7— 
2f r . ‘2f„ 


<7 


E-. E-i A 


(2.41) 


The direction of electric field is from the positive to the negative plate. 
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Figure 2.25 The parallel plate capacitor. 


Thus, the electric field is localised between the two plates and is uniform 
throughout. For plates with finite area, this will not be true near the outer 
boundaries of the plates. The field lines bend outward at the edges — an 
effect called ‘fringing of the field’. By the same token, o will not be strictly 
uniform on the entire plate. [E and a are related by Eq. (2.35).] However, 

for d 2 « A, these effects can be ignored in the regions sufficiently far 
from the edges, and the field there is given by Eq. (2.41). Now for uniform 
electric field, potential difference is simply the electric field times the 

















distance between the plates, that is, 
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v - e a - 


€r. A 


(2.42) 


The capacitance C of the parallel plate capacitor is then 


C 


Q 

V 


^- (2.43) 


which, as expected, depends only on the geometry of the system. For 
typical values like A = 1 m 2 , d = 1 mm, we get 


C = 


6.-S3 x 10~ L ^ C" N~ L m~~ xlnT 
10" 3 m 


S.S5xlO“-F (2.44) 


(You can check that if 1 F= 1C V” 1 = 1C (NC^m)- 1 = 1 C 2 1ST 1 nr 1 .) 

This shows that IF is too big a unit in practice, as remarked earlier. 
Another way of seeing the ‘bigness’ of 1F is to calculate the area of the 
plates needed to have C = IF for a separation of, say 1 cm: 

Cd 


A = 


IF x 10“" m 




8.85x10 


-12 


c 2 n 


- L m - 2 


- 10 m (2.45) 


which is a plate about 30 km in length and breadth! 








2.13 Effect of Dielectric on Capacitance 

With the understanding of the behaviour of dielectrics in an external field 
developed in Section 2.10, let us see how the capacitance of a parallel 
plate capacitor is modified when a dielectric is present. As before, we 
have two large plates, each of area A, separated by a distance d. The 
charge on the plates is ±Q, corresponding to the charge density ±o (with 
o = Q/A). When there is vacuum between the plates, 

“() 

and the potential difference V 0 is 
V 0 = E 0 d 

The capacitance C 0 in this case is 

^ Q A 

f „ -Tr-'-n-r 2.46 
V, ; a. 

Consider next a dielectric inserted between the plates fully occupying the 
intervening region. The dielectric is polarised by the field and, as 
explained in Section 2.10, the effect is equivalent to two charged sheets 
(at the surfaces of the dielectric normal to the field) with surface charge 
densities op and -op. The electric field in the dielectric then corresponds 
to the case when the net surface charge density on the plates is ±(o - 
op). 

That is, 


t .■. 


(2.47) 




so that the potential difference across the plates is 


V -Ed- - ^ d (2.48) 


For linear dielectrics, we expect op to be proportional to E 0 , i.e., to a. 
Thus, (a - op) is proportional to a and we can write 

C7-C7,, - j 7 (2.49) 

where K is a constant characteristic of the dielectric. Clearly, K > 1. We 
then have 

ad Qd 

V - — - -j—7 (2.50) 

C 0 K Ai},K 

The capacitance C, with dielectric between the plates, is then 

O = ® = C ° KA (2.51) 

V cl 

The product e 0 K is called the permittivity of the medium and is 
denoted by £ 

£ = £ 0 K (2.52) 


For vacuum K = 1 and £ = £ 0 ; £ 0 is called the permittivity of the vacuum. 
The dimensionless ratio 

K - - (2.53) 

is called the dielectric constant of the substance. As remarked before, 
from Eq. (2.49), it is clear that K is greater than 1. From Eqs. (2.46) and 
(2. 51) 



K - 



(2.54) 


Thus, the dielectric constant of a substance is the factor (>1) by which 
the capacitance increases from its vacuum value, when the dielectric is 
inserted fully between the plates of a capacitor. Though we arrived at 
Eq. (2.54) for the case of a parallel plate capacitor, it holds good for any 
type of capacitor and can, in fact, be viewed in general as a definition of 
the dielectric constant of a substance. 


ELECTRIC DISPLACEMENT 


We have introduced the notion of dielectric constant and arrived at 
Eq. (2.54), without giving the explicit relation between the induced 
charge density o p and the polarisation P. 


We take without proof the result that 
Oy - Pi ? 1 

where h is a unit vector along the outward normal to the surface. 
Above equation is general, true for any shape of the dielectric. For 
the slab in Fig. 2.23, P is along n at the right surface and opposite 
to n at the left surface. Thus at the right surface, induced charge 
density is positive and at the left surface, it is negative, as guessed 
already in our qualitative discussion before. Putting the equation for 
electric field in vector form 


Ega 


(T-Pgi 

l';) 


or (£ 0 E + P) gn =o 




The quantity £ 0 E + P is called the electric displacement and is 
denoted by D. It is a vector quantity. Thus, 

D = £ 0 E + P, D gri = a, 

The significance of D is this : in vacuum, E is related to the free 
charge density o. When a dielectric medium is present, the 
corresponding role is taken up by D. For a dielectric medium, it 
is D not E that is directly related to free charge density a, as seen in 
above equation. Since P is in the same direction as E, all the three 
vectors P, E and D are parallel. 

The ratio of the magnitudes of D and E is 

;> c-.K 

E a-dn 

Thus, 

D = £ 0 K E 

and P = D-£ 0 E = £ 0 (K -1)E 

This gives for the electric susceptibility x© defined in Eq. (2.37) 

Xe =£ 0 (K-1) 


Example 2.8 A slab of material of dielectric constant K has the same 
area as the plates of a parallel-plate capacitor but has a thickness 
(3/4)d, where d is the separation of the plates. How is the 
capacitance changed when the slab is inserted between the plates? 







Solution Let E 0 = V 0 /d be the electric field between the plates when 
there is no dielectric and the potential difference is V 0 . If the dielectric 
is now inserted, the electric field in the dielectric will be E = E 0 /K. The 
potential difference will then be 





K + 3 

1 K 


The potential difference decreases by the factor (K + 3)/K while the 
free charge Q 0 on the plates remains unchanged. The capacitance 

thus increases 

a -ix 

V r K 3 \v, K i 3 J 


2.14 Combination of Capacitors 

We can combine several capacitors of capacitance C 2 ,..., C n to 

obtain a system with some effective capacitance C. The effective 
capacitance depends on the way the individual capacitors are combined. 
Two simple possibilities are discussed below. 

2.14.1 Capacitors in Series 

Figure 2.26 shows capacitors C-| and C 2 combined in series. 

The left plate of C-| and the right plate of C 2 are connected to two 

terminals of a battery and have charges Q and -Q , respectively. It then 
follows that the right plate of C-| has charge -Q and the left plate of C 2 






has charge Q. If this was not so, the net charge on each capacitor would 
not be zero. This would result in an electric field in the conductor 
connecting C^and C 2 . Charge would flow until the net charge on both C 1 

and C 2 is zero and there is no electric field in the conductor connecting 

C 1 and C 2 . Thus, in the series combination, charges on the two plates 

(±Q) are the same on each capacitor. The total potential drop V across 
the combination is the sum of the potential drops V 1 and V 2 across C 1 


and C 2 , respectively. 
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Figure 2.26 Combination of two capacitors in series. 
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Figure 2.27 Combination of n capacitors in series. 






















V = V 1 + v 2 = 


Q 

C, 


\ 


0 


(2.55) 


i.e., 


V 

Q 


1 1 

r ’ r . (2-56) 

Ol Oik 


Now we can regard the combination as an effective capacitor with charge 
Q and potential difference V. The effective capacitance of the 
combination is 


C = 


Q 

y 


(2.57) 


We compare Eq. (2.57) with Eq. (2.56), and obtain 


1 


C 


1 



+ 



(2.58) 


The proof clearly goes through for any number of capacitors arranged in 
a similar way. Equation (2.55), for n capacitors arranged in series, 
generalises to 


v= v; -y + + V t _ = 


Q . 0 


- 


c, o. 


Q_ 

c. 


(2.59) 


Following the same steps as for the case of two capacitors, we get the 
general formula for effective capacitance of a series combination of n 
capacitors: 
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I I I 


I 



(2.60) 


2.14.2 Capacitors in Parallel 

Figure 2.28 (a) shows two capacitors arranged in parallel. In this case, 













the same potential difference is applied across both the capacitors. But 
the plate charges (±Q-|) on capacitor 1 and the plate charges (±Q 2 ) on 

the capacitor 2 are not necessarily the same: 

Q-, = C.,V, Q 2 = C 2 V (2.61) 

The equivalent capacitor is one with charge 
Q = Q-, + Q 2 (2.62) 

and potential difference V. 

Q = CV = C-,V + C 2 V (2.63) 

The effective capacitance C is, from Eq. (2.63), 

C = 0, + C 2 (2.64) 

The general formula for effective capacitance C for parallel combination 
of n capacitors [Fig. 2.28 (b)] follows similarly, 

Q = Q-i + Q 2 + ... + Qn (2.65) 
i.e., CV = C-,V + C 2 V + ... CnV (2.66) 
which gives 

C = C-, + C 2 + ... Cn (2.67) 
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Figure 2.28 Parallel combination of (a) two capacitors, (b) n capacitors. 


Example 2.9 A network of four 10 |jF capacitors is connected to a 
500 V supply, as shown in Fig. 2.29. Determine (a) the equivalent 
capacitance of the network and (b) the charge on each capacitor. 
(Note, the charge on a capacitor is the charge on the plate with 
higher potential, equal and opposite to the charge on the plate with 
lower potential.) 
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FIGURE 2.29 

Solution 

(a) In the given network, C 1 , C 2 and C 3 are connected in series. The 
effective capacitance C' of these three capacitors is given by 

J_ _ _i_ 

c f ~ c + c, c 

For C-| = C 2 = C 3 = 10 pF, C' = (10/3) pF. The network has C' and C 4 

connected in parallel. Thus, the equivalent capacitance C of the 
network is 

C = C' + C 4 = (f + 10) pF =13.3pF 

(b) Clearly, from the figure, the charge on each of the capacitors, 

C 2 and C 3 is the same, say Q. Let the charge on C 4 be Q'. Now, since 
the potential difference across AB is Q/C 1; across BC is Q/C 2 , across 
CD is Q/C 3 , we have 








Also, Q7C 4 = 500 V. 

This gives for the given value of the capacitances, 

Q 500 V x — u p 1.7x1Cr :j ' C and 

s 

Q*~ 500 V x 10jaF- 5.0x10^0 

2.15 Energy Stored in a Capacitor 

A capacitor, as we have seen above, is a system of two conductors with 
charge Q and -Q. To determine the energy stored in this configuration, 
consider initially two uncharged conductors 1 and 2. Imagine next a 
process of transferring charge from conductor 2 to conductor 1 bit by bit, 
so that at the end, conductor 1 gets charge Q. By charge conservation, 
conductor 2 has charge -Q at the end (Fig 2.30 ). 

In transferring positive charge from conductor 2 to conductor 1, work will 
be done externally, since at any stage conductor 1 is at a higher potential 
than conductor 2. To calculate the total work done, we first calculate the 
work done in a small step involving transfer of an infinitesimal (i.e., 
vanishingly small) amount of charge. Consider the intermediate situation 
when the conductors 1 and 2 have charges Q' and 
-Q' respectively. At this stage, the potential difference V' between 
conductors 1 to 2 is Q'/C, where C is the capacitance of the system. Next 
imagine that a small charge 5 Q' is transferred from conductor 2 to 1. 
Work done in this step (5 W'), resulting in charge Q' on conductor 1 
increasing to Q'+ 5 Q', is given by 





OW = V'OQ' = ® OQ' (2.68) 

Since 5 Q' can be made as small as we like, Eq. (2.68) can be written as 
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Figure 2.30 (a) Work done in a small step of building charge on conductor 1 from Q' to Q' + 5 Q'. 
(b) Total work done in charging the capacitor may be viewed as stored in the energy of electric 

field between the plates. 

SW = 1 [(S' + sgf - 9'-] (2.69) 

Equations (2.68) and (2.69) are identical because the term of second 

order in 5 Q', i.e., 5 Q' 2 /2C, is negligible, since 5 Q' is arbitrarily small. 
The total work done (W) is the sum of the small work (5 W) over the very 
large number of steps involved in building the charge Q' from zero to Q. 

W = X dW 

sum overall steps 

I ^.[(S' + d£n 2 -.Q' 2 l (2.70) 

sum over all steps ^^ 













1 [\SQ' 2 0] i [V2SQ'f SQ' r ‘}+{&8Q'f ) + ... 
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+ {Q 2 - (Q - 6QV >] (2.71) 


1 IQ ! -0|= S (2.72) 
2 C '' 1C 


The same result can be obtained directly from Eq. (2.68) by integration 




2 C 


This is not surprising since integration is nothing but summation of a 
large number of small terms. 


We can write the final result, Eq. (2.72) in different ways 
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gV' (2.73) 


Since electrostatic force is conservative, this work is stored in the form of 
potential energy of the system. For the same reason, the final result for 
potential energy [Eq. (2.73)] is independent of the manner in which the 
charge configuration of the capacitor is built up. When the capacitor 
discharges, this stored-up energy is released. It is possible to view the 
potential energy of the capacitor as ‘stored’ in the electric field between 
the plates. To see this, consider for simplicity, a parallel plate capacitor [of 
area A (of each plate) and separation d between the plates]. 


Energy stored in the capacitor 
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(2.74) 


The surface charge density o is related to the electric field E between the 



plates, 


E - (2.75) 

fo 

From Eqs. (2.74) and (2.75), we get 
Energy stored in the capacitor 

U = (1 /2)f 0 E* x Ad (2.76) 

Note that Ad is the volume of the region between the plates (where 
electric field alone exists). If we define energy density as energy stored 
per unit volume of space, Eq (2.76) shows that 

Energy density of electric field, 
u =(1/2)e 0 E 2 (2.77) 

Though we derived Eq. (2.77) for the case of a parallel plate capacitor, 
the result on energy density of an electric field is, in fact, very general 
and holds true for electric field due to any configuration of charges. 


Example 2.10 (a) A 900 pF capacitor is charged by 100 V battery 
[Fig. 2.31(a)]. How much electrostatic energy is stored by the 
capacitor? (b) The capacitor is disconnected from the battery and 
connected to another 900 pF capacitor [Fig. 2.31(b)]. What is the 
electrostatic energy stored by the system? 
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Figure 2.31 


(a) The charge on the capacitor is 

Q = CV = 900 x 10- 12 F x 100 V = 9 x 10“ 8 C 
The energy stored by the capacitor is 
= (1/2) CV 2 = (1/2) QV 
= (1/2) x 9 x 10- 8 C x 100 V = 4.5 x 10- 6 J 

(b) In the steady situation, the two capacitors have their positive 
plates at the same potential, and their negative plates at the same 
potential. Let the common potential difference be V'. The charge on 
each capacitor is then Q' = CV'. By charge conservation, Q' = Q/2. 
This implies V' = V/2. The total energy of the system is 

= 2 x 1 Q' V'= 1 QV = 2.25 x 10 -c J 
2 4 

















Thus in going from (a) to (b), though no charge is lost; the final 
energy is only half the initial energy. Where has the remaining energy 
gone? 

There is a transient period before the system settles to the 
situation (b). During this period, a transient current flows from the first 
capacitor to the second. Energy is lost during this time in the form of 
heat and electromagnetic radiation. 


2.16 Van de Graaff Generator 


This is a machine that can build up high voltages of the order of a few 
million volts. The resulting large electric fields are used to accelerate 
charged particles (electrons, protons, ions) to high energies needed for 
experiments to probe the small scale structure of matter. The principle 
underlying the machine is as follows. 

Suppose we have a large spherical conducting shell of radius R, on 
which we place a charge Q. This charge spreads itself uniformly all over 
the sphere. As we have seen in Section 1.14, the field outside the sphere 
is just that of a point charge Q at the centre; while the field inside the 
sphere vanishes. So the potential outside is that of a point charge; and 
inside it is constant, namely the value at the radius R. We thus have: 

Potential inside conducting spherical shell of radius R carrying charge Q 

= constant 


1 Q 

4jt£ 0 R 


(2.78) 


Now, as shown in Fig. 2.32, let us suppose that in some way we 
introduce a small sphere of radius r, carrying some charge q, into the 




large one, and place it at the centre. The potential due to this new charge 
clearly has the following values at the radii indicated: 

Potential due to small sphere of radius r carrying charge q 

1 q 

= at surface of small sphere 

4tu' 0 r 

1 q 

~ at large shell of radius R. (2.79) 
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Taking both charges q and Q into account we have for the total potential 
V and the potential difference the values 
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Assume now that q is positive. We see that, independent of the amount 
of charge Q that may have accumulated on the larger sphere and even if 
it is positive, the inner sphere is always at a higher potential: the 
difference V(r)-V(R) is positive. The potential due to Q is constant upto 
radius R and so cancels out in the difference! 


This means that if we now connect the smaller and larger sphere by a 
wire, the charge q on the former will immediately flow onto the matter, 
even though the charge Q may be quite large. The natural tendency is for 
positive charge to move from higher to lower potential. Thus, provided we 
are somehow able to introduce the small charged sphere into the larger 
one, we can in this way keep piling up larger and larger amount of charge 



on the latter. The potential (Eq. 2.78) at the outer sphere would also keep 

rising, at least until we reach the breakdown field of air. 

Total charge Q 


+ + + 
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Figure 2.32 Illustrating the principle of the electrostatic generator. 
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This is the principle of the Van de Graaff generator. It is a machine 
capable of building up potential difference of a few million volts, and fields 

close to the breakdown field of air which is about 3 * 10 6 V/m. A 
schematic diagram of the Van de Graaff generator is given in Fig. 2.33. A 
large spherical conducting shell (of few metres radius) is supported at a 






height several metres above the ground on an insulating column. A long 
narrow endless belt insulating material, like rubber or silk, is wound 
around two pulleys - one at ground level, one at the centre of the shell. 
This belt is kept continuously moving by a motor driving the lower pulley. 
It continuously carries positive charge, sprayed on to it by a brush at 
ground level, to the top. There it transfers its positive charge to another 
conducting brush connected to the large shell. Thus positive charge is 
transferred to the shell, where it spreads out uniformly on the outer 
surface. In this way, voltage differences of as much as 6 or 8 million volts 
(with respect to ground) can be built up. 



source 

Figure 2.33 Principle of construction of Van de Graaff generator. 
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1. Electrostatic force is a conservative force. Work done by an 
external force (equal and opposite to the electrostatic force) in 
bringing a charge q from a point R to a point P is VP - VR, which is 
the difference in potential energy of charge q between the final and 
initial points. 

2. Potential at a point is the work done per unit charge (by an external 
agency) in bringing a charge from infinity to that point. Potential at a 
point is arbitrary to within an additive constant, since it is the potential 
difference between two points which is physically significant. If 
potential at infinity is chosen to be zero; potential at a point with 
position vector r due to a point charge Q placed at the origin is given 
is given by 


V(r) 


i Q 

4 it/.; , r 


3. The electrostatic potential at a point with position vector r due to a 
point dipole of dipole moment p placed at the origin is 


The result is true also for a dipole (with charges -q and q separated 
by 2a) for r» a. 

4. For a charge configuration ql, q2, ..., qn with position vectors rl, 
r2, ... rn, the potential at a point P is given by the superposition 
principle 


where rIP is the distance between ql and P, as and so on. 







5. An equipotential surface is a surface over which potential has a 
constant value. For a point charge, concentric spheres centred at a 
location of the charge are equipotential surfaces. The electric field E 
at a point is perpendicular to the equipotential surface through the 
point. E is in the direction of the steepest decrease of potential. 

6. Potential energy stored in a system of charges is the work done 
(by an external agency) in assembling the charges at their locations. 
Potential energy of two charges ql, q2 at rl, r2 is given by 

( . if/. <h 
4 ^V: r i;> 

where rl2 is distance between ql and q2. 

7. The potential energy of a charge q in an external potential V(r) is 
qV(r). 

The potential energy of a dipole moment p in a uniform electric field E 

is -p.E. 

8. Electrostatics field E is zero in the interior of a conductor; just 
outside the surface of a charged conductor, E is normal to the surface 

given by E ° n where n is the unit vector along the outward 

^<1 

normal to the surface and o is the surface charge density. Charges in 
a conductor can reside only at its surface. Potential is constant within 
and on the surface of a conductor. In a cavity within a conductor (with 
no charges), the electric field is zero. 

9. A capacitor is a system of two conductors separated by an 
insulator. Its capacitance is defined by C = Q/V, where Q and -Q are 
the charges on the two conductors and V is the potential difference 
between them. C is determined purely geometrically, by the shapes, 




sizes and relative positions of the two conductors. The unit of 
capacitance is farad:, 

1 F = 1 C V -1 . For a parallel plate capacitor (with vacuum between 
the plates), 


where A is the area of each plate and d the separation between 
them. 

10. If the medium between the plates of a capacitor is filled with an 
insulating substance (dielectric), the electric field due to the charged 
plates induces a net dipole moment in the dielectric. This effect, 
called polarisation, gives rise to a field in the opposite direction. The 
net electric field inside the dielectric and hence the potential 
difference between the plates is thus reduced. Consequently, the 
capacitance C increases from its value CO when there is no medium 
(vacuum), 

C = KCO 

where K is the dielectric constant of the insulating substance. 

11. For capacitors in the series combination, the total capacitance C 
is given by 

J 1 1 J 

— —-1-h .,. 

C C, C,: C, 

In the parallel combination, the total capacitance C is: 

C = C1 +C2 + C3 + ... 

where Cl, C2, C3... are individual capacitances. 

12. The energy U stored in a capacitor of capacitance C, with charge 






Q and voltage V is 

U = — QV = — CV'- = —— 

2 2 2 C 

The electric energy density (energy per unit volume) in a region with 
electric field is (V2)zOE 2 . 

13. A Van de Graaff generator consists of a large spherical 
conducting shell (a few metre in diameter). By means of a moving 
belt and suitable brushes, charge is continuously transferred to the 
shell and potential difference of the order of several million volts is 
built up, which can be used for accelerating charged particles. 


Physical quantity 

Symbol 

Dimensions 

Unit 

Remark 

Potential 

^ or V 

[M 1 L 2 !^ 3 A" 1 ] 

V 

Potential difference is 

Capacitance 

C 

[M -1 L ' 2 T " 4 A 2 | 

F 

physically significant 

Polarisation 

P 

[L ' 2 ATJ 

C m 2 

Dipole moment per unit 

Dielectric constant 

K 

[Dimensionless] 


volume 


Points to Ponder 

1. Electrostatics deals with forces between charges at rest. But if 
there is a force on a charge, how can it be at rest? Thus, when we 
are talking of electrostatic force between charges, it should be 
understood that each charge is being kept at rest by some 
unspecified force that opposes the net Coulomb force on the charge. 

2. A capacitor is so configured that it confines the electric field lines 








within a small region of space. Thus, even though field may have 
considerable strength, the potential difference between the two 
conductors of a capacitor is small. 

3. Electric field is discontinuous across the surface of a spherical 
charged shell. It is zero inside and^n outside. Electric potential is, 

however continuous across the surface, equal to qMueOR at the 
surface. 

4. The torque p * E on a dipole causes it to oscillate about E. Only if 
there is a dissipative mechanism, the oscillations are damped and the 
dipole eventually aligns with E. 

5. Potential due to a charge q at its own location is not defined - it is 
infinite. 

6. In the expression qV(r) for potential energy of a charge q, V(r) is 
the potential due to external charges and not the potential due to q. 
As seen in point 5, this expression will be ill-defined if V(r) includes 
potential due to a charge q itself. 

7. A cavity inside a conductor is shielded from outside electrical 
influences. It is worth noting that electrostatic shielding does not work 
the other way round; that is, if you put charges inside the cavity, the 
exterior of the conductor is not shielded from the fields by the inside 
charges. 


Exercises 

2.1 Two charges 5 * 10 -8 C and -3 * 10 -8 C are located 16 cm apart. 
At what point(s) on the line joining the two charges is the electric 






potential zero? Take the potential at infinity to be zero. 

2.2 A regular hexagon of side 10 cm has a charge 5 pC at each of its 
vertices. Calculate the potential at the centre of the hexagon. 

2.3 Two charges 2 pC and -2 pC are placed at points A and B 6 cm 
apart. 

(a) Identify an equipotential surface of the system. 

(b) What is the direction of the electric field at every point on this 
surface? 

2.4 A spherical conductor of radius 12 cm has a charge of 1.6 * 10“ 
7 C distributed uniformly on its surface. What is the electric field 

(a) inside the sphere 

(b) just outside the sphere 

(c) at a point 18 cm from the centre of the sphere? 

2.5 A parallel plate capacitor with air between the plates has a 

capacitance of 8 pF (IpF = 10 -12 F). What will be the capacitance if 
the distance between the plates is reduced by half, and the space 
between them is filled with a substance of dielectric constant 6? 

2.6 Three capacitors each of capacitance 9 pF are connected in 
series. 

(a) What is the total capacitance of the combination? 

(b) What is the potential difference across each capacitor if the 
combination is connected to a 120 V supply? 

2.7 Three capacitors of capacitances 2 pF, 3 pF and 4 pF are 
connected in parallel. 




(a) What is the total capacitance of the combination? 

(b) Determine the charge on each capacitor if the combination is 
connected to a 100 V supply. 

Physical quantity Symbol Dimensions Unit Remark 

Potential (ft or V [M 1 L 2 T -3 A -1 ] V Potential difference is physically 
significant 

Capacitance C [IVM L -2 T -4 A 2 ] F 

Polarisation P [L -2 AT] C rrf 2 Dipole moment per unit volume 
Dielectric constant K [Dimensionless] 

2.8 In a parallel plate capacitor with air between the plates, each 

plate has an area of 6 x 10 -3 m 2 and the distance between the plates 
is 3 mm. Calculate the capacitance of the capacitor. If this capacitor 
is connected to a 100 V supply, what is the charge on each plate of 
the capacitor? 

2.9 Explain what would happen if in the capacitor given in Exercise 
2.8, a 3 mm thick mica sheet (of dielectric constant = 6) were inserted 
between the plates, 

(a) while the voltage supply remained connected. 

(b) after the supply was disconnected. 

2.10 A 12pF capacitor is connected to a 50V battery. How much 
electrostatic energy is stored in the capacitor? 

2.11 A 600pF capacitor is charged by a 200V supply. It is then 
disconnected from the supply and is connected to another uncharged 
600 pF capacitor. How much electrostatic energy is lost in the 
process? 




Additional Exercises 

2.12 A charge of 8 mC is located at the origin. Calculate the work 

done in taking a small charge of -2 x 10 -9 C from a point P (0, 0, 3 
cm) to a point Q (0, 4 cm, 0), via a point R (0, 6 cm, 9 cm). 

2.13 A cube of side b has a charge q at each of its vertices. 
Determine the potential and electric field due to this charge array at 
the centre of the cube. 

2.14 Two tiny spheres carrying charges 1.5 pC and 2.5 pC are 
located 30 cm apart. Find the potential and electric field: 

(a) at the mid-point of the line joining the two charges, and 

(b) at a point 10 cm from this midpoint in a plane normal to the line 
and passing through the mid-point. 

2.15 A spherical conducting shell of inner radius ^ and outer radius r 2 
has a charge Q. 

(a) A charge q is placed at the centre of the shell. What is the surface 
charge density on the inner and outer surfaces of the shell? 

(b) Is the electric field inside a cavity (with no charge) zero, even if 
the shell is not spherical, but has any irregular shape? Explain. 

2.16 (a) Show that the normal component of electrostatic field has a 
discontinuity from one side of a charged surface to another given by 

(E v - E ign = ^ 

where n is a unit vector normal to the surface at a point and a is the 
surface charge density at that point. (The direction of n is from side 
1 to side 2.) Hence, show that just outside a conductor, the electric 




field is a n/z 0 . 

(b) Show that the tangential component of electrostatic field is 
continuous from one side of a charged surface to another. [Hint: For 
(a), use Gauss’s law. For, (b) use the fact that work done by 
electrostatic field on a closed loop is zero.] 

2.17 A long charged cylinder of linear charged density A is 
surrounded by a hollow co-axial conducting cylinder. What is the 
electric field in the space between the two cylinders? 

2.18 In a hydrogen atom, the electron and proton are bound at a 
distance of about 0.53 A: 

(a) Estimate the potential energy of the system in eV, taking the zero 
of the potential energy at infinite separation of the electron from 
proton. 

(b) What is the minimum work required to free the electron, given that 
its kinetic energy in the orbit is half the magnitude of potential energy 
obtained in (a)? 

(c) What are the answers to (a) and (b) above if the zero of potential 
energy is taken at 1.06 A separation? 

2.19 If one of the two electrons of a H 2 molecule is removed, we get a 

hydrogen molecular ion H + 2 . In the ground state of an H + 2 , the two 

protons are separated by roughly 1.5 A, and the electron is roughly 1 
A from each proton. Determine the potential energy of the system. 
Specify your choice of the zero of potential energy. 

2.20 Two charged conducting spheres of radii a and b are connected 
to each other by a wire. What is the ratio of electric fields at the 
surfaces of the two spheres? Use the result obtained to explain why 




charge density on the sharp and pointed ends of a conductor is 
higher than on its flatter portions. 

2.21 Two charges -q and +q are located at points (0, 0, -a) and (0, 0, 
a), respectively. 

(a) What is the electrostatic potential at the points (0, 0, z) and 
(x, y, 0) ? 

(b) Obtain the dependence of potential on the distance r of a point 
from the origin when r/a » 1. 

(c) How much work is done in moving a small test charge from the 
point (5,0,0) to (-7,0,0) along the x-axis? Does the answer change if 
the path of the test charge between the same points is not along the 
x-axis? 

2.22 Figure 2.34 shows a charge array known as an electric 
quadrupole. For a point on the axis of the quadrupole, obtain the 
dependence 

of potential on r for r/a » 1, and contrast your results with that due to 
an electric dipole, and an electric monopole (i.e., a single charge). 

a a 

P 

q -q -q q 

4- r- ► 

Figure 2.34 

2.23 An electrical technician requires a capacitance of 2 pF in a 
circuit across a potential difference of 1 kV. A large number of 1 pF 
capacitors are available to him each of which can withstand a 
potential difference of not more than 400 V. Suggest a possible 
arrangement that requires the minimum number of capacitors. 

2.24 What is the area of the plates of a 2 F parallel plate capacitor, 
given that the separation between the plates is 0.5 cm? [You will 






realise from your answer why ordinary capacitors are in the range of 
pF or less. However, electrolytic capacitors do have a much larger 
capacitance (0.1 F) because of very minute separation between the 
conductors.] 

2.25 Obtain the equivalent capacitance of the network in Fig. 2.35. 
For a 300 V supply, determine the charge and voltage across each 
capacitor. 

100 pF 


300 v 


c 4 

Figure 2.35 

2.26 The plates of a parallel plate capacitor have an area of 90 cm 2 
each and are separated by 2.5 mm. The capacitor is charged by 
connecting it to a 400 V supply. 

(a) How much electrostatic energy is stored by the capacitor? 

(b) View this energy as stored in the electrostatic field between the 
plates, and obtain the energy per unit volume u. Hence arrive at a 
relation between u and the magnitude of electric field E between the 
plates. 


C, 


200 pF 200 pF 


100 pF C * 















2.27 A 4 |jF capacitor is charged by a 200 V supply. It is then 
disconnected from the supply, and is connected to another 
uncharged 2 pF capacitor. How much electrostatic energy of the first 
capacitor is lost in the form of heat and electromagnetic radiation? 

2.28 Show that the force on each plate of a parallel plate capacitor 
has a magnitude equal to (!4) QE, where Q is the charge on the 
capacitor, and E is the magnitude of electric field between the plates. 
Explain the origin of the factor 14. 

2.29 A spherical capacitor consists of two concentric spherical 
conductors, held in position by suitable insulating supports (Fig. 
2.36). Show that the capacitance of a spherical capacitor is given by 


where r 1 and r 2 are the radii of outer and inner spheres, 
respectively. 

2.30 A spherical capacitor has an inner sphere of radius 12 cm and 
an outer sphere of radius 13 cm. The outer sphere is earthed and the 
inner sphere is given a charge of 2.5 pC. The space between the 
concentric spheres is filled with a liquid of dielectric constant 32. 

(a) Determine the capacitance of the capacitor. 

(b) What is the potential of the inner sphere? 

(c) Compare the capacitance of this capacitor with that of an isolated 
sphere of radius 12 cm. Explain why the latter is much smaller. 

2.31 Answer carefully: 

(a) Two large conducting spheres carrying charges Q 1 and Q 2 are 





brought close to each other. Is the magnitude of electrostatic force 
between them exactly given by Q1 Q 2 /4n£ 0 r 2 , where r is the distance 
between their centres? 

(b) If Coulomb’s law involved 1/r 3 dependence (instead of 1/r 2 ), would 
Gauss’s law be still true ? 

(c) A small test charge is released at rest at a point in an electrostatic 
field configuration. Will it travel along the field line passing through 
that point? 

(d) What is the work done by the field of a nucleus in a complete 
circular orbit of the electron? What if the orbit is elliptical? 

(e) We know that electric field is discontinuous across the surface of 
a charged conductor. Is electric potential also discontinuous there? 

(f) What meaning would you give to the capacitance of a single 
conductor? 

(g) Guess a possible reason why water has a much greater dielectric 
constant (= 80) than say, mica (= 6). 












Figure 2.36 

2.32 A cylindrical capacitor has two co-axial cylinders of length 15 cm 
and radii 1.5 cm and 1.4 cm. The outer cylinder is earthed and the 
inner cylinder is given a charge of 3.5 pC. Determine the capacitance 
of the system and the potential of the inner cylinder. Neglect end 
effects (i.e., bending of field lines at the ends). 

2.33 A parallel plate capacitor is to be designed with a voltage rating 

1 kV, using a material of dielectric constant 3 and dielectric strength 

about 10 7 Vm -1 . (Dielectric strength is the maximum electric field a 
material can tolerate without breakdown, i.e., without starting to 
conduct electricity through partial ionisation.) For safety, we should 
like the field never to exceed, say 10% of the dielectric strength. 
What minimum area of the plates is required to have a capacitance of 
50 pF? 

2.34 Describe schematically the equipotential surfaces corresponding 
to 

(a) a constant electric field in the z-direction, 

(b) a field that uniformly increases in magnitude but remains in a 
constant (say, z) direction, 

(c) a single positive charge at the origin, and 

(d) a uniform grid consisting of long equally spaced parallel charged 
wires in a plane. 

2.35 In a Van de Graaff type generator a spherical metal shell is to be 
a 15 x 10 6 V electrode. The dielectric strength of the gas surrounding 

the electrode is 5 * 10 7 Vm -1 . What is the minimum radius of the 
spherical shell required? (You will learn from this exercise why one 




cannot build an electrostatic generator using a very small shell which 
requires a small charge to acquire a high potential.) 

2.36 A small sphere of radius r 1 and charge q 1 is enclosed by a 

spherical shell of radius r 2 and charge q 2 . Show that if q-| is positive, 

charge will necessarily flow from the sphere to the shell (when the 
two are connected by a wire) no matter what the charge q 2 on the 

shell is. 

2.37 Answer the following: 

(a) The top of the atmosphere is at about 400 kV with respect to the 
surface of the earth, corresponding to an electric field that decreases 

with altitude. Near the surface of the earth, the field is about 100 Vrrr 

1 . Why then do we not get an electric shock as we step out of our 
house into the open? (Assume the house to be a steel cage so there 
is no field inside!) 

(b) A man fixes outside his house one evening a two metre high 
insulating slab carrying on its top a large aluminium sheet of area 

1m 2 . Will he get an electric shock if he touches the metal sheet next 
morning? 

(c) The discharging current in the atmosphere due to the small 
conductivity of air is known to be 1800 A on an average over the 
globe. Why then does the atmosphere not discharge itself completely 
in due course and become electrically neutral? In other words, what 
keeps the atmosphere charged? 

(d) What are the forms of energy into which the electrical energy of 
the atmosphere is dissipated during a lightning? 

(Hint: The earth has an electric field of about 100 Vrrr 1 at its surface 




in the downward direction, corresponding to a surface charge density 
= -10 -9 C rrr z 

Due to the slight conductivity of the atmosphere up to about 50 km (beyond which it is good 
conductor), about + 1800 C is pumped every second into the earth as a whole. The earth, 
however, does not get discharged since thunderstorms and lightning occurring continually all 
over the globe pump an equal amount of negative charge on the earth.) 
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Chapter Three 


Current Electricity 


3.1 Introduction 

In Chapter 1, all charges whether free or bound, were considered to be at 
rest. Charges in motion constitute an electric current. Such currents 
occur naturally in many situations. Lightning is one such phenomenon in 
which charges flow from the clouds to the earth through the atmosphere, 
sometimes with disastrous results. The flow of charges in lightning is not 
steady, but in our everyday life we see many devices where charges flow 
in a steady manner, like water flowing smoothly in a river. A torch and a 
cell-driven clock are examples of such devices. In the present chapter, 
we shall study some of the basic laws concerning steady electric 
currents. 

3.2 Electric Current 

Imagine a small area held normal to the direction of flow of charges. Both 
the positive and the negative charges may flow forward and backward 
across the area. In a given time interval t, let q+ be the net amount (i.e., 

forward minus backward) of positive charge that flows in the forward 
direction across the area. Similarly, let q_ be the net amount of negative 

charge flowing across the area in the forward direction. The net amount 
of charge flowing across the area in the forward direction in the time 
interval t, then, is q = q+- q_. This is proportional to t for steady current 

and the quotient 




1=3 (3.1) 

l 

is defined to be the current across the area in the forward direction. (If it 
turn out to be a negative number, it implies a current in the backward 
direction.) 

Currents are not always steady and hence more generally, we define the 
current as follows. Let AQ be the net charge flowing across a cross- 
section of a conductor during the time interval At [i.e., between times t 
and (t + At)]. Then, the current at time t across the cross-section of the 
conductor is defined as the value of the ratio of AQ to At in the limit of At 
tending to zero, 

I(i.) = Jini-^ (3.2) 

^ A t 

In SI units, the unit of current is ampere. An ampere is defined through 
magnetic effects of currents that we will study in the following chapter. An 
ampere is typically the order of magnitude of currents in domestic 
appliances. An average lightning carries currents of the order of tens of 
thousands of amperes and at the other extreme, currents in our nerves 
are in microamperes. 

3.3 Electric Currents in Conductors 

An electric charge will experience a force if an electric field is applied. If it 
is free to move, it will thus move contributing to a current. In nature, free 
charged particles do exist like in upper strata of atmosphere called the 
ionosphere. However, in atoms and molecules, the negatively charged 
electrons and the positively charged nuclei are bound to each other and 
are thus not free to move. Bulk matter is made up of many molecules, a 

gram of water, for example, contains approximately 10 22 molecules. 



These molecules are so closely packed that the electrons are no longer 
attached to individual nuclei. In some materials, the electrons will still be 
bound, i.e., they will not accelerate even if an electric field is applied. In 
other materials, notably metals, some of the electrons are practically free 
to move within the bulk material. These materials, generally called 
conductors, develop electric currents in them when an electric field is 
applied. 

If we consider solid conductors, then of course the atoms are tightly 
bound to each other so that the current is carried by the negatively 
charged electrons. There are, however, other types of conductors like 
electrolytic solutions where positive and negative charges both can 
move. In our discussions, we will focus only on solid conductors so that 
the current is carried by the negatively charged electrons in the 
background of fixed positive ions. 

Consider first the case when no electric field is present. The electrons will 
be moving due to thermal motion during which they collide with the fixed 
ions. An electron colliding with an ion emerges with the same speed as 
before the collision. However, the direction of its velocity after the 
collision is completely random. At a given time, there is no preferential 
direction for the velocities of the electrons. Thus on the average, the 
number of electrons travelling in any direction will be equal to the number 
of electrons travelling in the opposite direction. So, there will be no net 
electric current. 



Figure 3.1 Charges +Q and -Q put at the ends of a metallic cylinder. The electrons will drift 
because of the electric field created to neutralise the charges. The current thus will stop after a 









while unless the charges +Q and -Q are continuously replenished. 

Let us now see what happens to such a piece of conductor if an electric 
field is applied. To focus our thoughts, imagine the conductor in the 
shape of a cylinder of radius R (Fig. 3.1). Suppose we now take two thin 
circular discs of a dielectric of the same radius and put positive charge 
+Q distributed over one disc and similarly -Q at the other disc. We attach 
the two discs on the two flat surfaces of the cylinder. An electric field will 
be created and is directed from the positive towards the negative charge. 
The electrons will be accelerated due to this field towards +Q. They will 
thus move to neutralise the charges. The electrons, as long as they are 
moving, will constitute an electric current. Hence in the situation 
considered, there will be a current for a very short while and no current 
thereafter. 

We can also imagine a mechanism where the ends of the cylinder are 
supplied with fresh charges to make up for any charges neutralised by 
electrons moving inside the conductor. In that case, there will be a steady 
electric field in the body of the conductor. This will result in a continuous 
current rather than a current for a short period of time. Mechanisms, 
which maintain a steady electric field are cells or batteries that we shall 
study later in this chapter. In the next sections, we shall study the steady 
current that results from a steady electric field in conductors. 

3.4 Ohm’s Law 

A basic law regarding flow of currents was discovered by G.S. Ohm in 
1828, long before the physical mechanism responsible for flow of 
currents was discovered. Imagine a conductor through which a current I 
is flowing and let V be the potential difference between the ends of the 
conductor. Then Ohm’s law states that 


A 



(a) 




Figure 3.2 Illustrating the relation R = pl/A fora rectangular slab of length I and area of cross- 

section A. 


V « I 

or, V = R I (3.3) 

where the constant of proportionality R is called the resistance of the 
conductor. The SI units of resistance is ohm, and is denoted by the 
symbol Q. The resistance R not only depends on the material of the 
conductor but also on the dimensions of the conductor. The dependence 
of R on the dimensions of the conductor can easily be determined as 
follows. 

Consider a conductor satisfying Eq. (3.3) to be in the form of a slab of 











length I and cross sectional area A [Fig. 3.2(a)]. Imagine placing two such 
identical slabs side by side [Fig. 3.2(b)], so that the length of the 
combination is 21. The current flowing through the combination is the 
same as that flowing through either of the slabs. If V is the potential 
difference across the ends of the first slab, then V is also the potential 
difference across the ends of the second slab since the second slab is 
identical to the first and the same current I flows through both. The 
potential difference across the ends of the combination is clearly sum of 
the potential difference across the two individual slabs and hence equals 
2V. The current through the combination is I and the resistance of the 
combination R c is [from Eq. (3.3)], 



Georg Simon Ohm (1787-1854) German physicist, professor at 
Munich. Ohm was led to his law by an analogy between the 
conduction of heat: the electric field is analogous to the temperature 
gradient, and the electric current is analogous to the heat flow. 




2V 


2 R (3.4) 


since V/l = R, the resistance of either of the slabs. Thus, doubling the 
length of a conductor doubles the resistance. In general, then resistance 
is proportional to length, 





R^t (3.5) 


Next, imagine dividing the slab into two by cutting it lengthwise so that 
the slab can be considered as a combination of two identical slabs of 
length I, but each having a cross sectional area of A/2 [Fig. 3.2(c)]. 

For a given voltage V across the slab, if I is the current through the entire 
slab, then clearly the current flowing through each of the two half-slabs is 
1/2. Since the potential difference across the ends of the half-slabs is V, 
i.e., the same as across the full slab, the resistance of each of the half¬ 
slabs R-i is 

R, =—— = 2—=2R. (3.6) 

iJ/2) I 

Thus, halving the area of the cross-section of a conductor doubles the 
resistance. In general, then the resistance R is inversely proportional to 
the cross-sectional area, 

R « - (3.7) 

A 

Combining Eqs. (3.5) and (3.7), we have 

R « — (3.8) 

A 

and hence for a given conductor 

R=p- (3.9) 

A 

where the constant of proportionality p depends on the material of the 
conductor but not on its dimensions, p is called resistivity. 

Using the last equation, Ohm’s law reads 



V IxR 


Current per unit area (taken normal to the current), I/A, is called current 

density and is denoted by j. The SI units of the current density are A/m 2 . 
Further, if E is the magnitude of uniform electric field in the conductor 
whose length is I, then the potential difference V across its ends is El. 
Using these, the last equation reads 

E I = j p I 

or, E = j p (3.11) 

The above relation for magnitudes E and j can indeed be cast in a 
vector form. The current density, (which we have defined as the current 
through unit area normal to the current) is also directed along E, and is 
also a vector j (e j E/E). Thus, the last equation can be written as, 

E=jp (3.12) 

or, j = o E (3.13) 

where o El/p is called the conductivity. Ohm’s law is often stated in an 
equivalent form, Eq. (3.13) in addition to Eq.(3.3). In the next section, we 
will try to understand the origin of the Ohm’s law as arising from the 
characteristics of the drift of electrons. 

3.5 Drift of Electrons and the Origin of Resistivity 

As remarked before, an electron will suffer collisions with the heavy fixed 
ions, but after collision, it will emerge with the same speed but in random 
directions. If we consider all the electrons, their average velocity will be 
zero since their directions are random. Thus, if there are N electrons and 

the velocity of the i th electron (i = 1,2, 3, ... N ) at a given time is Vj, then 




0 (3.14) 



Figure 3.3 A schematic picture of an electron moving from a point A to another point B through 
repeated collisions, and straight line travel between collisions (full lines). If an electric field is 
applied as shown, the electron ends up at point B' (dotted lines). A slight drift in a direction 

opposite the electric field is visible. 

Consider now the situation when an electric field is present. Electrons will 
be accelerated due to this 
field by 

a - (3.15) 

m 

where -e is the charge and m is the mass of an electron. Consider again 

the i th electron at a given time t. This electron would have had its last 
collision some time before t, and let tj be the time elapsed after its last 

collision. If Vj was its velocity immediately after the last collision, then its 






velocity Vj at time t is 


V = v + 


f _ e E ' 

i, 171 j 


t, (3-16) 


since starting with its last collision it was accelerated (Fig. 3.3) with an 
acceleration given by Eq. (3.15) for a time interval tj. The average 

velocity of the electrons at time t is the average of all the Vj’s. The 

average of Vj’s is zero [Eq. (3.14)] since immediately after any collision, 

the direction of the velocity of an electron is completely random. The 
collisions of the electrons do not occur at regular intervals but at random 
times. Let us denote by t, the average time between successive 
collisions. Then at a given time, some of the electrons would have spent 
time more than t and some less than t. In other words, the time tj in 

Eq. (3.16) will be less than t for some and more than t for others as we 
go through the values of i = 1, 2.N. The average value of tj then is t 

(known as relaxation time). Thus, averaging Eq. (3.16) over the 
N-electrons at any given time t gives us for the average velocity v d 


v = (V ) =(v) 

\ 'Cijeragi 


eE , 

— L. 


e L 

0 - T 
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eE 


in 


r (3.17) 


This last result is surprising. It tells us that the electrons move with an 
average velocity which is independent of time, although electrons are 
accelerated. This is the phenomenon of drift and the velocity v d in Eq. 

(3.17) is called the drift velocity. 




* 


Ax = At-► 



Figure 3.4 Current in a metallic conductor. The magnitude of current density in a metal is the 


magnitude of charge contained in a cylinder of unit area and length vd. 


Because of the drift, there will be net transport of charges across any 
area perpendicular to E. Consider a planar area A, located inside the 
conductor such that the normal to the area is parallel to E (Fig. 3.4). Then 
because of the drift, in an infinitesimal amount of time At, all electrons to 
the left of the area at distances upto |v d |At would have crossed the area. 

If n is the number of free electrons per unit volume in the metal, then 
there are n At |v d |A such electrons. Since each electron carries a charge 

-e, the total charge transported across this area A to the right in time At 
is -ne A|v d |At. E is directed towards the left and hence the total charge 

transported along E across the area is negative of this. The amount of 
charge crossing the area A in time At is by definition [Eq. (3.2)] I At, 
where I is the magnitude of the current. Hence, 


I At = I neA v , 


At (3.18) 


Substituting the value of |v d | from Eq. (3.17) 


I At 


e~A 

m 


n At E 


(3.19) 


By definition I is related to the magnitude |j| of the current density by 
I = |j|A (3.20) 

Hence, from Eqs.(3.19) and (3.20), 












T E (3.21) 


J = 

m 

The vector j is parallel to E and hence we can write Eq. (3.21) in the 
vector form 

j=" C rE (3.22) 
m 

Comparison with Eq. (3.13) shows that Eq. (3.22) is exactly the Ohm’s 
law, if we identify the conductivity a as 

2 

a = —r (3.23) 
m 

We thus see that a very simple picture of electrical conduction 
reproduces Ohm’s law. We have, of course, made assumptions that t 
and n are constants, independent of E. We shall, in the next section, 
discuss the limitations of Ohm’s law. 


Example 3.1 (a) Estimate the average drift speed of conduction 

electrons in a copper wire of cross-sectional area 1.0 x io -7 m 2 
carrying a current of 1.5 A. Assume that each copper atom 
contributes roughly one conduction electron. The density of copper is 

9.0 x io 3 kg/m 3 , and its atomic mass is 63.5 u. (b) Compare the drift 
speed obtained above with, (i) thermal speeds of copper atoms at 
ordinary temperatures, (ii) speed of propagation of electric field along 
the conductor which causes the drift motion. 

Solution 

(a) The direction of drift velocity of conduction electrons is opposite to 
the electric field direction, i.e., electrons drift in the direction of 





increasing potential. The drift speed vd is given by Eq. (3.18) 
vd = (1/neA) 

Now, e = 1.6 x 1CT 19 C, A = 1.0 x 10- 7 m 2 , I = 1.5 A. The density of 
conduction electrons, n is equal to the number of atoms per cubic 
metre (assuming one conduction electron per Cu atom as is 
reasonable from its valence electron count of one). A cubic metre of 

copper has a mass of 9.0 * 10 3 kg. Since 6.0 x 10 23 copper atoms 
have a mass of 63.5 g, 

6.0x10™ f , ,, 

n = - x £3.0 x 1 0 - 

OT.5 

= 8.5 x 10 28 nr 3 
which gives, 

1.5 

d ~ 8.5 xlO 28 x 1.6 x 10“ 19 x 1.0 x 10 -7 
= 1.1 x io -3 m s -1 = 1.1 mm s _1 

(b) (i) At a temperature T, the thermal speed* of a copper atom of 
mass M is obtained from [<(1/2) Mv 2 > = (3/2) k B T ] and is thus 
typically of the order of ^ik..T/M , where kB is the Boltzmann 

constant. For copper at 300 K, this is about 2 x 10 2 m/s. This figure 
indicates the random vibrational speeds of copper atoms in a 
conductor. Note that the drift speed of electrons is much smaller, 

about 10 -5 times the typical thermal speed at ordinary temperatures. 

(ii) An electric field travelling along the conductor has a speed of an 

electromagnetic wave, namely equal to 3.0 x io 8 m s -1 

(You will learn about this in Chapter 8). The drift speed is, in 







comparison, extremely small; smaller by a factor of 10 -11 . 


* See Eq. (13.23) of Chapter 13 from Class XI book. 


Example 3.2 

(a) In Example 3.1, the electron drift speed is estimated to be only a 

few mm s -1 for currents in the range of a few amperes? How then is 
current established almost the instant a circuit is closed? 

(b) The electron drift arises due to the force experienced by electrons 
in the electric field inside the conductor. But force should cause 
acceleration. Why then do the electrons acquire a steady average 
drift speed? 

(c) If the electron drift speed is so small, and the electron’s charge is 
small, how can we still obtain large amounts of current in a 
conductor? 

(d) When electrons drift in a metal from lower to higher potential, 
does it mean that all the ‘free’ electrons of the metal are moving in the 
same direction? 

(e) Are the paths of electrons straight lines between successive 
collisions (with the positive ions of the metal) in the (i) absence of 
electric field, (ii) presence of electric field? 

Solution 

(a) Electric field is established throughout the circuit, almost instantly 
(with the speed of light) causing at every point a local electron drift. 




Establishment of a current does not have to wait for electrons from 
one end of the conductor travelling to the other end. However, it does 
take a little while for the current to reach its steady value. 

(b) Each ‘free’ electron does accelerate, increasing its drift speed until 
it collides with a positive ion of the metal. It loses its drift speed after 
collision but starts to accelerate and increases its drift speed again 
only to suffer a collision again and so on. On the average, therefore, 
electrons acquire only a drift speed. 

(c) Simple, because the electron number density is enormous, 

~10 29 rrr 3 . 

(d) By no means. The drift velocity is superposed over the large 
random velocities of electrons. 

(e) In the absence of electric field, the paths are straight lines; in the 
presence of electric field, the paths are, in general, curved. 


3.5.1 Mobility 

As we have seen, conductivity arises from mobile charge carriers. In 
metals, these mobile charge carriers are electrons; in an ionised gas, 
they are electrons and positive charged ions; in an electrolyte, these can 
be both positive and negative ions. 

An important quantity is the mobility p defined as the magnitude of the 
drift velocity per unit electric field: 

fj= d (3.24) 

E 

The SI unit of mobility is m 2 A/s and is 10 4 of the mobility in practical units 
(cm 2 /Vs). Mobility is positive. From Eq. (3.17), we have 



Hence, 



t-* r 

(3-25) 

m 


where t is the average collision time for electrons. 


3.6 Limitations of Ohm’s Law 

Although Ohm’s law has been found valid over a large class of materials, 
there do exist materials and devices used in electric circuits where the 
proportionality of V and I does not hold. The deviations broadly are one 
or more of the following types: 



Figure 3.5 The dashed line represents the linear Ohm’s law. The solid line is the voltage V versus 

current I for a good conductor. 


(a) V ceases to be proportional to I (Fig. 3.5). 

(b) The relation between V and I depends on the sign of V. In other 








words, if I is the current for a certain V, then reversing the direction of V 
keeping its magnitude fixed, does not produce a current of the same 
magnitude as I in the opposite direction (Fig. 3.6). This happens, for 
example, in a diode which we will study in Chapter 14. 
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Figure 3.6 Characteristic curve of a diode. Note the different scales for negative and positive 


values of the voltage and current. 



Figure 3 .7 Variation of current versus voltage for GaAs. 


(c) The relation between V and I is not unique, i.e., there is more than 
one value of V for the same current I (Fig. 3.7). A material exhibiting such 











behaviour is GaAs. 


Materials and devices not obeying Ohm’s law in the form of Eq. (3.3) are 
actually widely used in electronic circuits. In this and a few subsequent 
chapters, however, we will study the electrical currents in materials that 
obey Ohm’s law. 

3.7 Resistivity of Various Materials 

The resistivities of various common materials are listed in Table 3.1. The 
materials are classified as conductors, semiconductors and insulators 
depending on their resistivities, in an increasing order of their values. 

Metals have low resistivities in the range of 10 -8 Qm to 10 -6 Qm. At the 
other end are insulators like ceramic, rubber and plastics having 

resistivities 10 18 times greater than metals or more. In between the two 
are the semiconductors. These, however, have resistivities 
characteristically decreasing with a rise in temperature. The resistivities 
of semiconductors are also affected by presence of small amount of 
impurities. This last feature is exploited in use of semiconductors for 
electronic devices. 


Table 3. 1 Resistivities of some materials 


Material 

Resistivity', p 

(fl m) at 0 e C 

Temperature coefficient 
of resistivity, a (°C) 1 

- at 0°C 

p dT 

Conductors 



Silver 

1.6 1O -0 

0.0041 

Copper 

1.7 * KT 0 

0.0068 

Aluminium 

2.7 * KT 0 

0.0043 

Tungsten 

5.6x KT 0 

0.0045 

Iron 

10 * lOr 0 

0.0065 

Platinum 

11 X lOr 0 

0.0039 

Mercury 

98 X 10^ 

0.0009 

Nichrome 

-100 X 1O -0 

0.0004 

(alloy of Ni, Fe, Cr] 



Manganin (alloy] 

48 X lOr 0 

0.002 X 10^ 

S emic ondu c t ors 



Carbon (graphite] 

3.5 x KT 5 

- 0.0005 

Gemianium 

0.46 

-0.05 

Silicon 

2300 

-0.07 

Insulators 



Pure Water 

2.5 x 10 s 


Glass 

10 10 - 10 14 


Haid Rubber 

10 13 - 10 16 


NaCl 

-10 14 


Fused Quartz 

-10 16 



Commercially produced resistors for domestic use or in laboratories are 
of two major types: wire bound resistors and carbon resistors. Wire 
bound resistors are made by winding the wires of an alloy, viz., 
manganin, constantan, nichrome or similar ones. The choice of these 
materials is dictated mostly by the fact that their resistivities are relatively 
insensitive to temperature. These resistances are typically in the range of 
a fraction of an ohm to a few hundred ohms. 

Resistors in the higher range are made mostly from carbon. Carbon 
resistors are compact, inexpensive and thus find extensive use in 
electronic circuits. Carbon resistors are small in size and hence their 
values are given using a colour code. 












Table 3.2 Resistor colour codes 



Colour 

Number 

Multiplier 

Tolerance [%) 

Black 

0 

1 


Brown 

1 

10 1 


Red 

2 

10 2 


Orange 

3 

10 a 


Yellow 

4 

10 4 


Green 

5 

10 5 


Blue 

6 

10 6 


Violet 

7 

10 7 


Gray 

8 

10 a 


White 

9 

10 s 


Gold 


10 _i 

5 

Silver 


KT 2 

10 

No colour 



20 


The resistors have a set of co-axial coloured rings on them whose 
significance are listed in Table 3.2. The first two bands from the end 
indicate the first two significant figures of the resistance in ohms. The 
third band indicates the decimal multiplier (as listed in Table 3.2). The last 
band stands for tolerance or possible variation in percentage about the 
indicated values. Sometimes, this last band is absent and that indicates a 
tolerance of 20% (Fig. 3.8). For example, if the four colours are orange, 

blue, yellow and gold, the resistance value is 36 * 10 4 Q, with a tolerence 
value of 5%. 









Silver 




Violet 



Figure 3.8 Colour coded resistors (a) (22 * 102 Cl) ± 10%, (b) (47 * 10 Cl) ± 5%. 


3.8 Temperature Dependence of Resistivity 

The resistivity of a material is found to be dependent on the temperature. 
Different materials do not exhibit the same dependence on temperatures. 
Over a limited range of temperatures, that is not too large, the resistivity 
of a metallic conductor is approximately given by, 

p T = p 0 [1 + a (T-T 0 )] (3.26) 

where p T is the resistivity at a temperature T and p 0 is the same at a 
reference temperature T 0 . a is called the temperature co-efficient of 

resistivity, and from Eq. (3.26), the dimension of a is (Temperature) -1 . For 
metals, a is positive and values of a for some metals at T 0 = 0°C are 












listed in Table 3.1. 


The relation of Eq. (3.26) implies that a graph of p T plotted against T 

would be a straight line. At temperatures much lower than 0°C, the graph, 
however, deviates considerably from a straight line (Fig. 3.9). 

Equation (3.26) thus, can be used approximately over a limited range of 
T around any reference temperature T 0 , where the graph can be 

approximated as a straight line. 



Figure 3.9 Resistivity pT of copperas a function of temperature T. 



Temperature T(K) ► 


Figure 3.10 Resistivity pT of nichrome as a function of absolute temperature T. 






Figure 3.11 Temperature dependence of resistivity fora typical semiconductor. 


Some materials like Nichrome (which is an alloy of nickel, iron and 
chromium) exhibit a very weak dependence of resistivity with temperature 
(Fig. 3.10). Manganin and constantan have similar properties. These 
materials are thus widely used in wire bound standard resistors since 
their resistance values would change very little with temperatures. 

Unlike metals, the resistivities of semiconductors decrease with 
increasing temperatures. Atypical dependence is shown in Fig. 3.11. 


We can qualitatively understand the temperature dependence of 
resistivity, in the light of our derivation of Eq. (3.23). From this equation, 
resistivity of a material is given by 



rn 

n c ' r 


(3.27) 


p thus depends inversely both on the number n of free electrons per unit 
volume and on the average time t between collisions. As we increase 





temperature, average speed of the electrons, which act as the carriers of 
current, increases resulting in more frequent collisions. The average time 
of collisions t, thus decreases with temperature. 

In a metal, n is not dependent on temperature to any appreciable extent 
and thus the decrease in the value of t with rise in temperature causes p 
to increase as we have observed. 

For insulators and semiconductors, however, n increases with 
temperature. This increase more than compensates any decrease in t in 
Eq.(3.23) so that for such materials, p decreases with temperature. 


Example 3.3 An electric toaster uses nichrome for its heating 
element. When a negligibly small current passes through it, its 
resistance at room temperature (27.0 °C) is found to be 75.3 Q. 
When the toaster is connected to a 230 V supply, the current settles, 
after a few seconds, to a steady value of 2.68 A. What is the steady 
temperature of the nichrome element? The temperature coefficient of 
resistance of nichrome averaged over the temperature range 

involved, is 1.70 x 10- 4 °C“ 1 . 

Solution When the current through the element is very small, heating 
effects can be ignored and the temperature T 1 of the element is the 

same as room temperature. When the toaster is connected to the 
supply, its initial current will be slightly higher than its steady value of 
2.68 A. But due to heating effect of the current, the temperature will 
rise. This will cause an increase in resistance and a slight decrease 
in current. In a few seconds, a steady state will be reached when 
temperature will rise no further, and both the resistance of the 
element and the current drawn will achieve steady values. The 



resistance R 2 at the steady temperature T 2 is 

„ 230 V n ^ „ , 

Ro = = 83,3 LI 

z 2,68 A 

Using the relation 

R 2 = Ri [1 + q(t 2 -t 1 )] 


with a = 1.70 x 10 -4 °C -1 , we get 

T 2 - _ _ ^*- 7^2) _ = 820 oq 

(75,8) x 1 ,70x10“' 

that is, T 2 = (820 + 27.0) °C = 847 °C 

Thus, the steady temperature of the heating element (when heating 
effect due to the current equals heat loss to the surroundings) is 847 
°C. 


Example 3.4 The resistance of the platinum wire of a platinum 
resistance thermometer at the ice point is 5 Q and at steam point is 
5.39 Q. When the thermometer is inserted in a hot bath, the 
resistance of the platinum wire is 5.795 Q. Calculate the temperature 
of the bath. 

Solution R 0 = 5 Q, R 100 = 5.23 Q and Rt = 5.795 Q 

Now, t~ xlC'O, R-R, {l+at) 

R no ~ R o 








5.795 5 


5.23-5 
0.795 


xl0( 


0.25 


x 100 = 345.65 °C 


3.9 Electrical Energy, Power 

Consider a conductor with end points A and B, in which a current I is 
flowing from A to B. The electric potential at A and B are denoted by V(A) 
and V(B) respectively. Since current is flowing from A to B, V(A) > V(B) 
and the potential difference across AB is V = V(A) - V(B) > 0. 

In a time interval At, an amount of charge AQ = I At travels from A to B. 
The potential energy of the charge at A, by definition, was Q V(A) and 
similarly at B, it is Q V(B). Thus, change in its potential energy AU pot is 

AUpot = Final potential energy - Initial potential energy 

= AQ[(V (B) - V (A)] = -AQ V 
= -I VAt < 0 (3.28) 

If charges moved without collisions through the conductor, their kinetic 
energy would also change so that the total energy is unchanged. 
Conservation of total energy would then imply that, 

AK = -AU pot (3.29) 
that is, 

AK = I VAt > 0 (3.30) 

Thus, in case charges were moving freely through the conductor under 
the action of electric field, their kinetic energy would increase as they 
move. We have, however, seen earlier that on the average, charge 




carriers do not move with acceleration but with a steady drift velocity. 
This is because of the collisions with ions and atoms during transit. 
During collisions, the energy gained by the charges thus is shared with 
the atoms. The atoms vibrate more vigorously, i.e., the conductor heats 
up. Thus, in an actual conductor, an amount of energy dissipated as heat 
in the conductor during the time interval At is, 

AW = I VAt (3.31) 

The energy dissipated per unit time is the power dissipated 
P = AW/At and we have, 

P = I V (3.32) 

Using Ohm’s law V = IR, we get 


P = I 2 R = V 2 /R (3.33) 

as the power loss (“ohmic loss”) in a conductor of resistance R carrying a 
current I. It is this power which heats up, for example, the coil of an 
electric bulb to incandescence, radiating out heat and light. 


Where does the power come from? As we have reasoned before, we 


need an external source to keep a steady current through the conductor. 


It is clearly this source which must supply this power. In the simple circuit 
shown with a cell (Fig.3.12), it is the chemical energy of the cell which 


supplies this power for as long as it can. 

i 
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Electrolyte 















Figure 3.12 Heat is produced in the resistor R which is connected across the terminals of a cell. 
The energy dissipated in the resistor R comes from the chemical energy of the electrolyte. 


The expressions for power, Eqs. (3.32) and (3.33), show the dependence 
of the power dissipated in a resistor R on the current through it and the 
voltage across it. 

Equation (3.33) has an important application to power transmission. 
Electrical power is transmitted from power stations to homes and 
factories, which may be hundreds of miles away, via transmission cables. 
One obviously wants to minimise the power loss in the transmission 
cables connecting the power stations to homes and factories. We shall 
see now how this can be achieved. Consider a device R, to which a 
power P is to be delivered via transmission cables having a resistance R c 

to be dissipated by it finally. If V is the voltage across R and I the current 
through it, then 

P = V I (3.34) 

The connecting wires from the power station to the device has a finite 
resistance R c . The power dissipated in the connecting wires, which is 

wasted is P c with 

Pc = I 2 Rc 

P 2 R 

' (3.35) 

from Eq. (3.32). Thus, to drive a device of power P, the power wasted in 

the connecting wires is inversely proportional to V 2 . The transmission 
cables from power stations are hundreds of miles long and their 
resistance R c is considerable. To reduce P c , these wires carry current at 

enormous values of V and this is the reason for the high voltage danger 



signs on transmission lines — a common sight as we move away from 
populated areas. Using electricity at such voltages is not safe and hence 
at the other end, a device called a transformer lowers the voltage to a 
value suitable for use. 

3.10 Combination of Resistors - Series and Parallel 

The current through a single resistor R across which there is a potential 
difference V is given by Ohm’s law I = V/R. Resistors are sometimes 
joined together and there are simple rules for calculation of equivalent 
resistance of such combination. 

, ^ AAA AAA _ . _^_ AAAAAA __ AAAAAA _^ , 

l * 1 R 1 R 1 

1 9 

Figure 3.13 A series combination of two resistors R and R . 

Two resistors are said to be in series if only one of their end points is 
joined (Fig. 3.13). If a third resistor is joined with the series combination 
of the two (Fig. 3.14), then all three are said to be in series. Clearly, we 
can extend this definition to series combination of any number of 
resistors. 
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Figure 3.14 A series combination of three resistors R , R , R . 


Two or more resistors are said to be in parallel if one end of all the 
resistors is joined together and similarly the other ends joined together 
(Fig. 3.15). 










1 9 

Figure 3.15 Two resistors R and R connected in parallel. 

Consider two resistors R-| and R 2 in series. The charge which leaves R-| 
must be entering R 2 . Since current measures the rate of flow of charge, 
this means that the same current I flows through R 1 and R 2 . By Ohm’s 
law: 

Potential difference across R 1 = V 1 = I R^ and 
Potential difference across R 2 = V 2 = I R 2 . 

The potential difference V across the combination is V 1 +V 2 . Hence, 

V = V 1 +V 2 = I(R 1 + R 2 ) (3.36) 

This is as if the combination had an equivalent resistance R eq , which by 
Ohm’s law is 

Req - j = ( R 1 + R 2 ) (3-37) 

If we had three resistors connected in series, then similarly 

V = I R 1 + I R 2 + I R 3 = I (R-|+ R 2 + R 3 ). (3.38) 

This obviously can be extended to a series combination of any number n 
of resistors R 1; R 2 ., R n . The equivalent resistance R eq is 


R eq - R-| + R 2 + . . . + R n (3.39) 









Consider now the parallel combination of two resistors (Fig. 3.15). The 
charge that flows in at A from the left flows out partly through R 1 and 

partly through R 2 . The currents I, l 1 , l 2 shown in the figure are the rates of 

flow of charge at the points indicated. Hence, 

I = l-i + l 2 (3.40) 

The potential difference between A and B is given by the Ohm’s law 
applied to R-| 

V = I-, R-, (3.41) 

Also, Ohm’s law applied to R 2 gives 


V = l 2 R 2 (3.42) 


V V 

■■■ I = l-| + l 2 = — + — = V 
«1 *2 


( 1 1 ^ 

+ 


(3.43) 


If the combination was replaced by an equivalent resistance R eq , we 
would have, by Ohm’s law 

! - r? (3-44) 

vq 

Hence, 

1 1 1 

+ (3.45) 

R n R R. t w } 

K : J ■ ■ 


We can easily see how this extends to three resistors in parallel 
(Fig. 3.16). 
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Figure 3.16 Parallel combination of three resistors R , R and R . 

Exactly as before 
I = l-i + l 2 + l 3 (3.46) 

and applying Ohm’s law to R 1 , R 2 and R 3 we get, 

V = l 1 R-|, V = l 2 R 2 , V = l 3 R 3 (3.47) 

So that 


1 = 1-, + I 2 + I 3 =V 


1 1 

+ 


R 2 R 3 y 


(3.48) 


An equivalent resistance R eq that replaces the combination, would be 
such that 

I- l (3.49) 

v( l 

and hence 


1 


1 1 

— + — 


R R 


R 


(3.50) 


We can reason similarly for any number of resistors in parallel. The 
equivalent resistance of n resistors R 1 , R 2 . . . ,R n is 


1 


1 1 


1 


R 


R R, 


R. 


(3.51) 















These formulae for equivalent resistances can be used to find out 
currents and voltages in more complicated circuits. Consider for example, 
the circuit in Fig. (3.17), where there are three resistors R 1 , R 2 and R 3 . 

R 2 and R 3 are in parallel and hence we can replace them by an 

equivalent R'f''; between point B and C with 
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Figure 3.17 A combination of three resistors R , R and R . R , R are in parallel with an 
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equivalent resistance . R and are in series with an equivalent resistance 


1 




1 1 
R~ R, 


or, 


R' 


R, It. 

R, - R, 


(3.52) 


The circuit now has R 1 and R;' r ; in series and hence their combination 
can be replaced by an equivalent resistance with 

KT = K‘ 1 R i < 3 - 53 ) 


If the voltage between A and C is V, the current I is given by 
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(3.54) 


3.11 Cells, emf, Internal Resistance 

We have already mentioned that a simple device to maintain a steady 
current in an electric circuit is the electrolytic cell. Basically a cell has two 
electrodes, called the positive (P) and the negative (N), as shown in Fig. 
3.18. They are immersed in an electrolytic solution. Dipped in the 
solution, the electrodes exchange charges with the electrolyte. The 
positive electrode has a potential difference V+ (V+ > 0) between itself 

and the electrolyte solution immediately adjacent to it marked A in the 
figure. Similarly, the negative electrode develops a negative potential - 
(V_ ) (V_ > 0) relative to the electrolyte adjacent to it, marked as B in the 

figure. When there is no current, the electrolyte has the same potential 
throughout, so that the potential difference between P and N is V+ - (-V_) 

= V+ + V_ . This difference is called the electromotive force (emf) of the 

cell and is denoted by £. Thus 
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Figure 3.18 (a) Sketch of an electrolyte cell with positive terminal P and negative terminal N. The 
gap between the electrodes is exaggerated for clarity. A and B are points in the electrolyte 
typically close to P and N. (b) the symbol for a cell, + referring to P and - referring to the N 
electrode. Electrical connections to the cell are made at P and N. 

£ = V + +V_ > 0 (3.55) 

Note that £ is, actually, a potential difference and not a force. The name 
emf, however, is used because of historical reasons, and was given at a 
time when the phenomenon was not understood properly. 

To understand the significance of £, consider a resistor R connected 
across the cell (Fig. 3.18). A current I flows across R from C to D. As 
explained before, a steady current is maintained because current flows 
from N to P through the electrolyte. Clearly, across the electrolyte the 
same current flows through the electrolyte but from N to P, whereas 
through R, it flows from P to N. 





























The electrolyte through which a current flows has a finite resistance r, 
called the internal resistance. Consider first the situation when R is 
infinite so that I = V/R = 0, where V is the potential difference between P 
and N. Now, 

V = Potential difference between P and A 
+ Potential difference between A and B 

+ Potential difference between B and N 
= s (3.56) 

Thus, emf £ is the potential difference between the positive and negative 
electrodes in an open circuit, i.e., when no current is flowing through the 
cell. 

If however R is finite, I is not zero. In that case the potential difference 
between P and N is 

V = V++ V_ — I r 
= £ - I r (3.57) 

Note the negative sign in the expression (I r) for the potential difference 
between A and B. This is because the current I flows from B to A in the 
electrolyte. 

In practical calculations, internal resistances of cells in the circuit may be 
neglected when the current I is such that £ » I r. The actual values of the 
internal resistances of cells vary from cell to cell. The internal resistance 
of dry cells, however, is much higher than the common electrolytic cells. 

We also observe that since V is the potential difference across R, we 
have from Ohm’s law 


V = I R (3.58) 



Combining Eqs. (3.57) and (3.58), we get 
I R = £ - I r 

Or, I = —(3.59) 

R + r 

The maximum current that can be drawn from a cell is for R = 0 and it is 
I max = e/r. However, in most cells the maximum allowed current is much 

lower than this to prevent permanent damage to the cell. 


Charges in clouds 

In olden days lightning was considered as an atmospheric flash of 
supernatural origin. It was believed to be the great weapon of Gods. 
But today the phenomenon of lightning can be explained scientifically 
by elementary principles of physics. 

Atmospheric electricity arises due to the separation of electric 
charges. In the ionosphere and magnetosphere strong electric 
current is generated from the solar-terrestrial interaction. In the lower 
atmosphere, the current is weaker and is maintained by 
thunderstorm. 

There are ice particles in the clouds, which grow, collide, fracture and 
break apart. The smaller particles acquire positive charge and the 
larger ones negative charge. These charged particles get separated 
by updrifts in the clouds and gravity. The upper portion of the cloud 
becomes positively charged and the middle negatively charged, 
leading to dipole structure. Sometimes a very weak positive charge is 
found near the base of the cloud. The ground is positively charged at 
the time of thunderstorm development. Also, cosmic and radioactive 




radiations ionise air into positive and negative ions and the air 
becomes (weakly) electrically conductive. The separation of charges 
produce tremendous amount of electrical potential within the cloud, 
as well, as between the cloud and ground. This can amount to 
millions of volts and eventually the electrical resistance in the air 
breaks down and lightning flash begins and thousands of amperes of 

current flows. The electric field is of the order of 10 5 V/m. A lightning 
flash is composed of a series of strokes with an average of about four 
and the duration of each flash is about 30 seconds. The average 

peak power per stroke is about 10 12 watts. 

During fair weather also there is charge in the atmosphere. The fair 
weather electric field arises due to the existence of a surface charge 
density at ground and an atmospheric conductivity, as well as, due to 
the flow of current from the ionosphere to the earth’s surface, which is 
of the order of picoampere / square metre. The surface charge 
density at ground is negative; the electric field is directed downward. 
Over land the average electric field is about 120 V/m, which 
corresponds to a surface charge density of 

-1.2 x io -9 C/m 2 . Over the entire earth’s surface, the total negative 
charge amount to about 600 kC. An equal positive charge exists in 
the atmosphere. This electric field is not noticeable in daily life. The 
reason why it is not noticed is that virtually everything, including our 
bodies, is conductor compared to air. 


Example 3.5 A network of resistors is connected to a 16 V battery 
with internal resistance of 10, as shown in Fig. 3.19: (a) Compute the 






equivalent resistance of the network, (b) Obtain the current in each 
resistor, (c) Obtain the voltage drops VAB, VBC and VCD. 
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Figure 3.19 

Solution 

(a) The network is a simple series and parallel combination of 
resistors. First the two 4Q resistors in parallel are equivalent to a 
resistor = [(4 x 4)/(4 + 4)] Q = 2 Q. 

In the same way, the 12 Q and 6 Q resistors in parallel are equivalent 
to a resistor of 

[(12 x 6)/(12 + 6)] Q = 4 Q. 

The equivalent resistance R of the network is obtained by combining 
these resistors (2 Q and 4 Q) with 1 Q in series, 
that is, 

R = 2Q + 4Q + 1 Q = 7Q. 

(b) The total current I in the circuit is 

I = 6 = 16V =2 A 

R + r (7 + ljW 

Consider the resistors between A and B. If l-| is the current in one of 
the 4 Q resistors and l 2 the current in the other, 












l-i x 4 = l 2 x 4 

that is, l-| = l 2 , which is otherwise obvious from the symmetry of the 
two arms. But l-| + l 2 = I = 2 A. Thus, 

l-i = l 2 = 1 A 

that is, current in each 4 Q resistor is 1 A. Current in 1 Q resistor 
between B and C would be 2 A. 

Now, consider the resistances between C and D. If l 3 is the current in 
the 12 Q resistor, and l 4 in the 6 Q resistor, 


l 3 * 12 — I 4 x 6 , i.e., I 4 - 21 ' 


But, l 3 + l 4 = I = 2 A 


Thus, l 3=(|) A, l 4 = 


A \ 


' J 


that is, the current in the 12Q resistor is (2/3) A, while the current in 
the 6 0 resistor is (4/3) A. 

(c) The voltage drop across AB is 

VAB = I 1 x4 = 1Ax40 = 4V, 


This can also be obtained by multiplying the total current between A 
and B by the equivalent resistance between A and B, that is, 

VAB = 2Ax2Q = 4V 

The voltage drop across BC is 

VBC = 2 A * 1 Q = 2 V 

Finally, the voltage drop across CD is 






VCD = 12 0 x | 3 = 12 Q x j^|j A = 8 V. 

This can alternately be obtained by multiplying total current between 
C and D by the equivalent resistance between C and D, 
that is, 

VCD = 2Ax4Q = 8 V 

Note that the total voltage drop across ADis4V + 2V + 8V = 14 V. 
Thus, the terminal voltage of the battery is 14 V, while its emf is 16 V. 
The loss of the voltage (= 2 V) is accounted for by the internal 
resistance 1 Q of the battery [2 A x 1 Q = 2 V], 


3.12 Cells in Series and in Parallel 

Like resistors, cells can be combined together in an electric circuit. And 
like resistors, one can, for calculating currents and voltages in a circuit, 
replace a combination of cells by an equivalent cell. 
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Figure 3.20 Two cells of emf’s and in the series. r\ r^ are their internal resistances. For 
connections across A and C, the combination can be considered as one cell of emf eeq and an 

internal resistance req. 

Consider first two cells in series (Fig. 3.20), where one terminal of the two 
cells is joined together leaving the other terminal in either cell free. 

£- 1 , z 2 are the emfs of the two cells and r^ r 2 their internal resistances, 

respectively. 

Let V (A), V (B), V (C) be the potentials at points A, B and C shown in 
Fig. 3.20. Then V (A) - V (B) is the potential difference between the 









positive and negative terminals of the first cell. We have already 
calculated it in Eq. (3.57) and hence, 

V'ar = V'( A | - V(B) = - / r, (3.60) 

Similarly, 

V{C)=c. 2 I r, (3.61) 

Hence, the potential difference between the terminals A and C of the 
combination is 


^ AC 


VtA] - VTC) = [V r (A) - V(E) ’ - ’ V (B) - WCj" 


- I c + i\ : ) - / lY| — r ,) (3.62) 

If we wish to replace the combination by a single cell between A and C of 
emf s eq and internal resistance r eq , we would have 


Vac - £ eq ^ r eq (3-63) 


Comparing the last two equations, we get 
Eeq = El + ^2 (3-64) 


and r eq = r 1 + r 2 (3.65) 


In Fig.3.20, we had connected the negative electrode of the first to the 
positive electrode of the second. If instead we connect the two negatives, 
Eq. (3.61) would change to V BC = —£ 2 — I r 2 ar| d we will get 


E eq = - £2 to > e 2 ) (3.66) 

The rule for series combination clearly can be extended to any number of 
cells: 

(i) The equivalent emf of a series combination of n cells is just the sum of 



their individual emfs, and 


(ii) The equivalent internal resistance of a series combination of n cells is 
just the sum of their internal resistances. 

This is so, when the current leaves each cell from the positive electrode. 
If in the combination, the current leaves any cell from the negative 
electrode, the emf of the cell enters the expression for s eq with a negative 

sign, as in Eq. (3.66). 



r. 



Figure 3.21 Two cells in parallel. For connections across A and C, the combination can be 
replaced by one cell of emf eeq and internal resistances req whose values are given in Eqs. (3.73) 


and (3.74). 


Next, consider a parallel combination of the cells (Fig. 3.21). ^ and l 2 are 
the currents leaving the positive electrodes of the cells. At the point B<|, ^ 
and l 2 flow in whereas the current I flows out. Since as much charge 
flows in as out, we have 
I = l-i + l 2 (3.67) 

Let V (B-|) and V (B 2 ) be the potentials at B 1 and B 2 , respectively. Then, 

considering the first cell, the potential difference across its terminals is V 
(B 1 )-V(B 2 ). Hence, from Eq. (3.57) 




, (3-68) 






Points B 1 and B 2 are connected exactly similarly to the second cell. 
Hence considering the second cell, we also have 

V = V (Z3, I - V ( B„) = £., - Lr, (3.69) 

Combining the last three equations 
i -1, i i. 




r. 


r... 


(3-76) 


In Fig. (3.21), we had joined the positive terminals together and similarly 
the two negative ones, so that the currents l-|, l 2 flow out of positive 

terminals. If the negative terminal of the second is connected to positive 
terminal of the first, Eqs. (3.75) and (3.76) would still be valid with s 2 —> - 

£2 



Gustav Robert Kirchhoff (1824 - 1887) German physicist, 
professor at Heidelberg and at 

Berlin. Mainly known for his development of spectroscopy, he also 
made many important contributions to mathe- matical physics, among 
them, his first and second rules for circuits. 


Equations (3.75) and (3.76) can be extended easily. If there are n cells of 
emf e-j, . . . e n and of internal resistances r^... r n respectively, connected 

in parallel, the combination is equivalent to a single cell of emf £ eq and 

internal resistance r eq , such that 




1 1 1 

— = - + ... H (3.77) 

I'eq J n 

^ = S- + ... + ^ (3.78) 

?eq ^1 ^n 

3.13 Kirchhoffs Rules 

Electric circuits generally consist of a number of resistors and cells 
interconnected sometimes in a complicated way. The formulae we have 
derived earlier for series and parallel combinations of resistors are not 
always sufficient to determine all the currents and potential differences in 
the circuit. Two rules, called Kirchhoff’s rules, are very useful for analysis 
of electric circuits. 

Given a circuit, we start by labelling currents in each resistor by a symbol, 
say I, and a directed arrow to indicate that a current I flows along the 
resistor in the direction indicated. If ultimately I is determined to be 
positive, the actual current in the resistor is in the direction of the arrow. If 
I turns out to be negative, the current actually flows in a direction 
opposite to the arrow. Similarly, for each source (i.e., cell or some other 
source of electrical power) the positive and negative electrodes are 
labelled, as well as, a directed arrow with a symbol for the current flowing 
through the cell. This will tell us the potential difference, V = V (P) - V (N) 
= £ - I r [Eq. (3.57) between the positive terminal P and the negative 
terminal N; I here is the current flowing from N to P through the cell]. If, 
while labelling the current I through the cell one goes from P to N, then of 
course 

V = £ + I r (3.79) 

Having clarified labelling, we now state the rules and the proof: 





(a) Junction rule: At any junction, the sum of the currents entering the 
junction is equal to the sum of currents leaving the junction (Fig. 3.22). 

This applies equally well if instead of a junction of several lines, we 
consider a point in a line. 

The proof of this rule follows from the fact that when currents are steady, 
there is no accumulation of charges at any junction or at any point in a 
line. Thus, the total current flowing in, (which is the rate at which charge 
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Figure 3.22 At junction a the current leaving is I + I and current entering is I. The junction rule 

says r = I 1 + I . At point h current entering is I . There is only one current leaving h and by 

1 1 
junction rule that will also be I . For the loops ‘ahdcba’ and ‘ahdefga’, the loop rules give -301 - 

41 I 3 + 45 = 0 and-301 1 + 21 I 2 - 80 = 0. 

(b) Loop rule: The algebraic sum of changes in potential around any 
closed loop involving resistors and cells in the loop is zero (Fig. 3.22). 

This rule is also obvious, since electric potential is dependent on the 
location of the point. Thus starting with any point if we come back to the 
same point, the total change must be zero. In a closed loop, we do come 
back to the starting point and hence the rule. 















Example 3.6 A battery of 10 V and negligible internal resistance is 
connected across the diagonally opposite corners of a cubical 
network consisting of 12 resistors each of resistance 1 Q (Fig. 3.23). 
Determine the equivalent resistance of the network and the current 
along each edge of the cube. 



Figure 3.23 

Solution The network is not reducible to a simple series and parallel 
combinations of resistors. There is, however, a clear symmetry in the 
problem which we can exploit to obtain the equivalent resistance of 
the network. 

The paths AA', AD and AB are obviously symmetrically placed in the 
network. Thus, the current in each must be the same, say, I. Further, 
at the corners A', B and D, the incoming current I must split equally 












into the two outgoing branches. In this manner, the current in all the 
12 edges of the cube are easily written down in terms of I, using 
Kirchhoff’s first rule and the symmetry in the problem. 

Next take a closed loop, say, ABCC'EA, and apply Kirchhoff’s second 
rule: 

-IR - (1/2)IR - IR + £ = 0 

where R is the resistance of each edge and £ the emf of battery. 
Thus, 

5 r . 

£ = - / R 

2 

The equivalent resistance Req of the network is 

R f 5 R 
:m 6 

For R = 1 Q, Req = (5/6) Q and for £ = 10 V, the total current (= 31) in 
the network is 

31 = 10 V/(5/6) Q = 12 A, i.e., I = 4 A 

The current flowing in each edge can now be read off from the 
Fig. 3.23. 


PHYSICS 

Similation for application of Kirchhoff’s rules: 

h ttp: //www. p h ys. h a wa i i. ed u/~te b/opti cs/j a va/ki rch 3/ 




It should be noted that because of the symmetry of the network, the great 
power of Kirchhoff’s rules has not been very apparent in Example 3.6. In 
a general network, there will be no such simplification due to symmetry, 
and only by application of Kirchhoff’s rules to junctions and closed loops 
(as many as necessary to solve the unknowns in the network) can we 
handle the problem. This will be illustrated in Example 3.7. 


Example 3.7 Determine the current in each branch of the network 
shown in Fig. 3.24. 


B 



Figure 3.24 


Solution Each branch of the network is assigned an unknown current 
to be determined by the application of Kirchhoff’s rules. To reduce the 
number of unknowns at the outset, the first rule of Kirchhoff is used at 
every junction to assign the unknown current in each branch. We 
then have three unknowns l 1; l 2 and l 3 which can be found by 

applying the second rule of Kirchhoff to three different closed loops. 
Kirchhoff’s second rule for the closed loop ADCA gives, 

10 - 4(h- l 2 ) + 2(l 2 + l 3 - l-i) - h = 0 [3.80(a)] 








that is, 7\-\- 6 I 2 - 2I 3 = 10 

For the closed loop ABCA, we get 

10 - 4I 2 - 2 (l 2 + l 3 ) - l<| =0 

that is, I-, + 6 I 2 + 2I 3 =10 [3.80(b)] 

For the closed loop BCDEB, we get 
5 — 2 (l 2 + l 3 ) - 2 (l 2 + l 3 - l-|) = 0 

that is, 2I-, - 4I 2 - 4I 3 = -5 [3.80(c)] 

Equations (3.80 a, b, c) are three simultaneous equations in three 
unknowns. These can be solved by the usual method to give 

11 = 2-5A, l 2 = ^ A, l 3 = 1 1 A 

O O 

The currents in the various branches of the network are 

AB : ° A, CA: 2 - A, DEB : I - A 

8 2 8 

AD : ]- A, CD : 0 A, BC : 2- A 

8 2 

It is easily verified that Kirchhoff’s second rule applied to the 
remaining closed loops does not provide any additional independent 
equation, that is, the above values of currents satisfy the second rule 
for every closed loop of the network. For example, the total voltage 
drop over the closed loop BADEB 

5 v +(|x4 jv- |^x4 Jv 

equal to zero, as required by Kirchhoff’s second rule. 




3.14 Wheatstone Bridge 

As an application of Kirchhoff’s rules consider the circuit shown in 

Fig. 3.25, which is called the Wheatstone bridge. The bridge has 

four resistors R 1; R 2 , R 3 and R 4 . Across one pair of diagonally opposite 

points (A and C in the figure) a source is connected. This (i.e., AC) is 
called the battery arm. Between the other two vertices, B and D, a 
galvanometer G (which is a device to detect currents) is connected. This 
line, shown as BD in the figure, is called the galvanometer arm. 

For simplicity, we assume that the cell has no internal resistance. In 
general there will be currents flowing across all the resistors as well as a 
current l g through G. Of special interest, is the case of a balanced bridge 

where the resistors are such that l g = 0. We can easily get the balance 

condition, such that there is no current through G. In this case, the 
Kirchhoff’s junction rule applied to junctions D and B (see the figure) 
immediately gives us the relations l-| = l 3 and l 2 = l 4 . Next, we apply 

Kirchhoff’s loop rule to closed loops ADBA and CBDC. The first loop 
gives 

-I-i R-| + 0 + l 2 R 2 = 0 (l g — 0) (3.81) 

and the second loop gives, upon using l 3 = l 1 , l 4 = l 2 

l 2 R 4 + 0 — h R 3 = 0 (3.82) 

From Eq. (3.81), we obtain, 

/, _ % 
h K, 

whereas from Eq. (3.82), we obtain, 



Hence, we obtain the condition 



This last equation relating the four resistors is called the balance 
condition for the galvanometer to give zero or null deflection. 
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Figure 3.25 


The Wheatstone bridge and its balance condition provide a practical 
method for determination of an unknown resistance. Let us suppose we 
have an unknown resistance, which we insert in the fourth arm; R 4 is thus 

not known. Keeping known resistances R 1 and R 2 in the first and second 

arm of the bridge, we go on varying R 3 till the galvanometer shows a null 

deflection. The bridge then is balanced, and from the balance condition 









the value of the unknown resistance R 4 is given by, 





[3.83(b)] 


A practical device using this principle is called the meter bridge. It will be 
discussed in the next section. 


Example 3.8 The four arms of a Wheatstone bridge (Fig. 3.26) have 
the following resistances: 



Figure 3.26 

A galvanometer of 15Q resistance is connected across BD. Calculate 
the current through the galvanometer when a potential difference of 
10 V is maintained across AC. 


Solution Considering the mesh BADB, we have 
100I-, + 15lg - 60l 2 = 0 











or 201-1 +3lg-12l 2 =0 [3.84(a)] 

Considering the mesh BCDB, we have 
10 (l-| - Ig) - 15lg - 5 (l 2 + Ig) = 0 

10^ - 30lg -5I 2 = 0 

2I-| - 6lg - l 2 = 0 [3.84(b)] 

Considering the mesh ADCEA, 

60l 2 + 5 (l 2 + Ig) = 10 

65l 2 + 5lg = 10 

13l 2 + Ig = 2 [3.84(c)] 

Multiplying Eq. (3.84b) by 10 
20l-| - 60lg - 10l 2 = 0 [3.84(d)] 

From Eqs. (3.84d) and (3.84a) we have 
63lg - 2I 2 = 0 

1 2 = 31.5lg [3.84(e)] 

Substituting the value of l 2 into Eq. [3.84(c)], we get 

13 (31.5lg ) + Ig = 2 
410.5 Ig = 2 

Ig = 4.87 mA. 


3.15 METER BRIDGE 




The meter bridge is shown in Fig. 3.27. It consists of a wire of length 1m 
and of uniform cross sectional area stretched taut and clamped between 
two thick metallic strips bent at right angles, as shown. The metallic strip 
has two gaps across which resistors can be connected. The end points 
where the wire is clamped are connected to a cell through a key. One 
end of a galvanometer is connected to the metallic strip midway between 
the two gaps. The other end of the galvanometer is connected to a 
‘jockey’. The jockey is essentially a metallic rod whose one end has a 

knife-edge which can slide over the wire to make electrical connection. 
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Figure 3.21 A meter bridge. Wire AC is 1 m long. R is a resistance to be measured and S is a 

standard resistance. 

R is an unknown resistance whose value we want to determine. It is 
connected across one of the gaps. Across the other gap, we connect a 
standard known resistance S. The jockey is connected to some point D 
on the wire, a distance I cm from the end A. The jockey can be moved 
along the wire. The portion AD of the wire has a resistance R cm l, where 

R cm is the resistance of the wire per unit centimetre. The portion DC of 

the wire similarly has a resistance R cm (100-1). 

The four arms AB, BC, DA and CD [with resistances R, S, R cm I and 
























R C m(1 00-1)] obviously form a Wheatstone bridge with AC as the battery 

arm and BD the galvanometer arm. If the jockey is moved along the wire, 
then there will be one position where the galvanometer will show no 
current. Let the distance of the jockey from the end A at the balance point 
be 1= l-|. The four resistances of the bridge at the balance point then are 

R, S, R cm I-i and R cm (100-I 1 ). The balance condition, Eq. [3.83(a)] gives 


^ _ ^fni \\ _ \\ 

S ~ (100 I,) - 100 I, 


(3.85) 


Thus, once we have found out l-|, the unknown resistance R is known in 
terms of the standard known resistance S by 


K = S 


1 00 - 1 , 


(3.86) 


By choosing various values of S, we would get various values of 1^ and 

calculate R each time. An error in measurement of l-| would naturally 

result in an error in R. It can be shown that the percentage error in R can 
be minimised by adjusting the balance point near the middle of the 
bridge, i.e., when ^ is close to 50 cm. (This requires a suitable choice 

of S.) 


Example 3.9 In a meter bridge (Fig. 3.27), the null point is found at a 
distance of 33.7 cm from A. If now a resistance of 12Q is connected 
in parallel with S, the null point occurs at 51.9 cm. Determine the 
values of R and S. 

Solution From the first balance point, we get 





R 

S 


88.7 

66.8 


(3-87) 


After S is connected in parallel with a resistance of 12Q , the 
resistance across the gap changes from S to Seq, where 

_ 128 
^ < r ' S - 12 


and hence the new balance condition now gives 


51.9 _ 
48.1 ~~ S 


R\S I 2 \ 
[28 


(3.88) 


Substituting the value of R/S from Eq. (3.87), we get 

51.9 _ ,8 + 12 88.7 

■ 18.1 12 * 68.5 


which gives S = 13.5Q. Using the value of R/S above, we get 
R = 6.86 Q. 


3.16 Potentiometer 

This is a versatile instrument. It is basically a long piece of uniform wire, 
sometimes a few meters in length across which a standard cell (B) is 
connected. In actual design, the wire is sometimes cut in several pieces 
placed side by side and connected at the ends by thick metal strip. 

(Fig. 3.28). In the figure, the wires run from A to C. The small vertical 
portions are the thick metal strips connecting the various sections of 
the wire. 

A current I flows through the wire which can be varied by a variable 
resistance (rheostat, R) in the circuit. Since the wire is uniform, the 
potential difference between A and any point at a distance I from A is 








e{l)=<j>l (3.89) 

where cp is the potential drop per unit length. 

Figure 3.28 (a) shows an application of the potentiometer to compare the 
emf of two cells of emf s 1 and z 2 ■ The points marked 1, 2, 3 form a two 

way key. Consider first a position of the key where 1 and 3 are connected 
so that the galvanometer is connected to z-\. The jockey is moved along 

the wire till at a point N 1; at a distance ^ from A, there is no deflection in 

the galvanometer. We can apply Kirchhoff’s loop rule to the closed loop 
ANISIA and get, 

cp l-| + 0 - £-] = 0 (3.90) 

Similarly, if another emf z 2 is balanced against l 2 (AN 2 ) 
cp l 2 + 0 - e 2 = 0 (3.91) 


From the last two equations 
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Figure 3.28 A potentiometer. G is a galvanometer and R a variable resistance (rheostat). 1, 2, 3 

are terminals of a two way key 

(a) circuit for comparing emfs of two cells; (b) circuit for determining internal resistance of a cell. 

£. I , 

(3.92) 

A k 

This simple mechanism thus allows one to compare the emfs of any two 
sources (£ 1 ,£ 2 )- In practice one of the cells is chosen as a standard cell 

whose emf is known to a high degree of accuracy. The emf of the other 
cell is then easily calculated from Eq. (3.92). 






























We can also use a potentiometer to measure internal resistance of a cell 
[Fig. 3.28 (b)]. For this the cell (emf £ ) whose internal resistance (r) is to 
be determined is connected across a resistance box through a key K 2 , as 

shown in the figure. With key K 2 open, balance is obtained at length ^ 

(AN-,). Then, 

£ = cp h [3.93(a)] 

When key K 2 is closed, the cell sends a current (I) through the resistance 

box (R). If V is the terminal potential difference of the cell and balance is 
obtained at length l 2 (AN 2 ), 

V = cp l 2 [3.93(b)] 

So, we have £/V = l-|/l 2 [3.94(a)] 


But, £ = I (r + R) and V = IR. This gives 
zN = (r+R)/R [3.94(b)] 

From Eq. [3.94(a)] and [3.94(b)] we have 
(R+r)/R = 


r = R 



^2 ) 


(3.95) 


Using Eq. (3.95) we can find the internal resistance of a given cell. 

The potentiometer has the advantage that it draws no current from the 
voltage source being measured. As such it is unaffected by the internal 
resistance of the source. 

Example 3.10 A resistance of R Q draws current from a 





potentiometer. The potentiometer has a total resistance R 0 Q 

(Fig. 3.29). A voltage V is supplied to the potentiometer. Derive an 
expression for the voltage across R when the sliding contact is in the 
middle of the potentiometer. 
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Figure 3.29 

Solution While the slide is in the middle of the potentiometer only 
half of its resistance (R 0 /2) will be between the points A and B. 

Hence, the total resistance between A and B, say, R^ will be given by 

the following expression: 
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The total resistance between A and C will be sum of resistance 
between A and B and B and C, i.e., R-| + R 0 /2 

The current flowing through the potentiometer will be 

V 2V 
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R+R.J2 


2H, + R 0 


















The voltage V 1 taken from the potentiometer will be the product of 
current I and resistance 


V-i = I Ri = 


i 


2 V 


2Ry + 


xR x 


' / 


Substituting for R 1s we have a 
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i, Ro + 2R , 


R 0 x R 
R 0 + 2R 
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Summary 

1. Current through a given area of a conductor is the net charge 
passing per unit time through the area. 

2. To maintain a steady current, we must have a closed circuit in 
which an external agency moves electric charge from lower to higher 
potential energy. The work done per unit charge by the source in 
taking the charge from lower to higher potential energy (i.e., from one 
terminal of the source to the other) is called the electromotive force, 
or emf, of the source. Note that the emf is not a force; it is the voltage 
difference between the two terminals of a source in open circuit. 

3. Ohm’s law: The electric current I flowing through a substance is 
proportional to the voltage V across its ends, i.e., V « | or V = Rl, 
















where R is called the resistance of the substance. The unit of 
resistance is ohm: IQ = 1 V A -1 . 

4. The resistance R of a conductor depends on its length I and 
cross-sectional area A through the relation, 

A 

where p, called resistivity is a property of the material and depends 
on temperature and pressure. 

5. Electrical resistivity of substances varies over a very wide range. 
Metals have low resistivity, in the range of 10 -8 Q m to 10 -6 Q m. 

Insulators like glass and rubber have 10 22 to 10 24 times greater 
resistivity. Semiconductors like Si and Ge lie roughly in the middle 
range of resistivity on a logarithmic scale. 

6. In most substances, the carriers of current are electrons; in some 
cases, for example, ionic crystals and electrolytic liquids, positive and 
negative ions carry the electric current. 

7. Current density j gives the amount of charge flowing per second 
per unit area normal to the flow, 

j = nq v d 

where n is the number density (number per unit volume) of charge 
carriers each of charge q, and v d is the drift velocity of the charge 

carriers. For electrons q = - e. If j is normal to a cross-sectional area 
A and is constant over the area, the magnitude of the current I 
through the area is nev d A. 

8. Using E = V/l, I = nev d A, and Ohm’s law, one obtains 




eE rur 

-= P - P-i 

rn m 

The proportionality between the force eE on the electrons in a metal 
due to the external field E and the drift velocity v d (not acceleration) 

can be understood, if we assume that the electrons suffer collisions 
with ions in the metal, which deflect them randomly. If such collisions 
occur on an average at a time interval t, 

v d = aT = eET/m 

where a is the acceleration of the electron. This gives 

m 

, 

ncr r 

9. In the temperature range in which resistivity increases linearly with 
temperature, the temperature coefficient of resistivity a is defined as 
the fractional increase in resistivity per unit increase in temperature. 

10. Ohm’s law is obeyed by many substances, but it is not a 
fundamental law of nature. It fails if 

(a) V depends on I non-linearly. 

(b) the relation between V and I depends on the sign of V for the 
same absolute value of V. 

(c) The relation between V and I is non-unique. 

An example of (a) is when p increases with I (even if temperature is 
kept fixed). A rectifier combines features (a) and (b). GaAs shows the 
feature (c). 

11. When a source of emf £ is connected to an external resistance R, 
the voltage V ext across R is given by 






where r is the internal resistance of the source. 

12. (a) Total resistance R of n resistors connected in series is given 
by 

R = R-i + R 2 +.+ R n 

(b) Total resistance R of n resistors connected in parallel is given by 

J L _J_ 

R ~ R R . R 

13. Kirchhoff’s Rules - 

(a) Junction Rule: At any junction of circuit elements, the sum of 
currents entering the junction must equal the sum of currents leaving 
it. 

(b) Loop Rule: The algebraic sum of changes in potential around any 
closed loop must be zero. 

14. The Wheatstone bridge is an arrangement of four resistances - 
R 1 , R 2 , R 3 , R 4 as shown in the text. The null-point condition is given 

by 

K, R. 

rI R, 

using which the value of one resistance can be determined, knowing 
the other three resistances. 

15. The potentiometer is a device to compare potential differences. 
Since the method involves a condition of no current flow, the device 
can be used to measure potential difference; internal resistance of a 









cell and compare emfs of two sources. 


Physical Quantity 

Symbol 

Dimensions 

Unit 

Remark 

Electric current 

I 

|AJ 

A 

SI base unit 

Charge 

Q< 9 

IT A| 
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Voltage, Electric 
potential difference 

V 

[M L a T^A" 1 ] 

V 

Work/cliarge 

Electromotive force 

£ 

|M L a T^A" 1 ] 

V 

Work/cliarge 

Resistance 

R 

[M Ct^a -2 ! 

n 

R ^ V/I 

Resistivity 

P 

[M CT^A -2 ] 

Q m 

R = pi/A 

Electrical 

conductivity 

& 

[M 1 LT* T* A 2 ] 

S 

<T= 1/p 

Electric field 

E 

|M L T" 3 A' 1 ] 

Vrn" 1 

Electric force 
charge 

Drift speed 
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[L T- 1 ] 

m s" 1 

e£t 

m 

Relaxation time 

T 

|T| 
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Current density 

j 

[L -3 AJ 

Am 5 

current/area 

Mobility 

P 

|M Ct^A 1 ] 

m 1 V _1 s 



Points to Ponder 

1. Current is a scalar although we represent current with an arrow. 
Currents do not obey the law of vector addition. That current is a 
scalar also follows from it’s definition. The current I through an area of 
cross-section is given by the scalar product of two vectors: 

I= j.AS 

where j and AS are vectors. 

2. Refer to V-l curves of a resistor and a diode as drawn in the text. A 
resistor obeys Ohm’s law while a diode does not. The assertion that 
V = IR is a statement of Ohm’s law is not true. This equation defines 










resistance and it may be applied to all conducting devices whether 
they obey Ohm’s law or not. The Ohm’s law asserts that the plot 
ofl versus V is linear i.e., R is independent of V. 

Equation E = p j leads to another statement of Ohm’s law, i.e., a 
conducting material obeys Ohm’s law when the resistivity of the 
material does not depend on the magnitude and direction of applied 
electric field. 

3. Homogeneous conductors like silver or semiconductors like pure 
germanium or germanium containing impurities obey Ohm’s law 
within some range of electric field values. If the field becomes too 
strong, there are departures from Ohm’s law in all cases. 

4. Motion of conduction electrons in electric field E is the sum of (i) 
motion due to random collisions and (ii) that due to E. The motion 
due to random collisions averages to zero and does not contribute 
to v d (Chapter 11, Textbook of Class XI). v d , thus is only due to 

applied electric field on the electron. 

5. The relation j = p v should be applied to each type of charge 
carriers separately. In a conducting wire, the total current and charge 
density arises from both positive and negative charges: 

j = p + v + + p_ v_ 

P = P + + P- 

Now in a neutral wire carrying electric current, 

P + = - P- 

Further, v+ - 0 which gives 

p = 0 




j = P-V 

Thus, the relation j = p v does not apply to the total current charge 
density. 

6. Kirchhoff’s junction rule is based on conservation of charge and the 
outgoing currents add up and are equal to incoming current at a 
junction. Bending or reorienting the wire does not change the validity 
of Kirchhoff’s junction rule. 


Exercises 

3.1 The storage battery of a car has an emf of 12 V. If the internal 
resistance of the battery is 0.4 Q, what is the maximum current that 
can be drawn from the battery? 

3.2 A battery of emf 10 V and internal resistance 3 Q is connected to 
a resistor. If the current in the circuit is 0.5 A, what is the resistance of 
the resistor? What is the terminal voltage of the battery when the 
circuit is closed? 

3.3 (a) Three resistors 1 Q, 2 Q, and 3 Q are combined in series. 
What is the total resistance of the combination? 

(b) If the combination is connected to a battery of emf 12 V and 
negligible internal resistance, obtain the potential drop across each 
resistor. 

3.4 (a) Three resistors 2 Q, 4 Q and 5 Q are combined in parallel. 
What is the total resistance of the combination? 

(b) If the combination is connected to a battery of emf 20 V and 
negligible internal resistance, determine the current through each 






resistor, and the total current drawn from the battery. 

3.5 At room temperature (27.0 °C) the resistance of a heating 
element is 100 Q. What is the temperature of the element if the 
resistance is found to be 117 Q, given that the temperature coefficient 

of the material of the resistor is 1.70 * 10 _4 °C _1 . 

3.6 A negligibly small current is passed through a wire of length 15 m 

and uniform cross-section 6.0 * 10 -7 m 2 , and its resistance is 
measured to be 5.0 Q. What is the resistivity of the material at the 
temperature of the experiment? 

3.7 A silver wire has a resistance of 2.1 Q at 27.5 °C, and a 
resistance of 2.7 Q at 100 °C. Determine the temperature coefficient 
of resistivity of silver. 

3.8 A heating element using nichrome connected to a 230 V supply 
draws an initial current of 3.2 A which settles after a few seconds to a 
steady value of 2.8 A. What is the steady temperature of the heating 
element if the room temperature is 27.0 °C? Temperature coefficient 
of resistance of nichrome averaged over the temperature range 

involved is 1.70 x 10” 4 °C- 1 . 

3.9 Determine the current in each branch of the network shown in 
Fig. 3.30: 




io n 


\ 


511 


A 


511 



Figure 3.30 


3.10 (a) In a meter bridge [Fig. 3.27], the balance point is found to be 
at 

39.5 cm from the end A, when the resistor Y is of 12.5 Q. Determine 
the resistance of X. Why are the connections between resistors in a 
Wheatstone or meter bridge made of thick copper strips? 

(b) Determine the balance point of the bridge above if X and Y are 
interchanged. 

(c) What happens if the galvanometer and cell are interchanged at 
the balance point of the bridge? Would the galvanometer show any 
current? 


3.11 A storage battery of emf 8.0 V and internal resistance 0.5 Q is 
being charged by a 120 V dc supply using a series resistor of 15.5 Q. 
What is the terminal voltage of the battery during charging? What is 
the purpose of having a series resistor in the charging circuit? 












3.12 In a potentiometer arrangement, a cell of emf 1.25 V gives a 
balance point at 35.0 cm length of the wire. If the cell is replaced by 
another cell and the balance point shifts to 63.0 cm, what is the emf 
of the second cell? 

3. 13 The number density of free electrons in a copper conductor 

estimated in Example 3.1 is 8.5 * 10 28 rrr 3 . How long does an 
electron take to drift from one end of a wire 3.0 m long to its other 

end? The area of cross-section of the wire is 2.0 x io -6 m 2 and it is 
carrying a current of 3.0 A. 

Additional Exercises 

3. 14 The earth’s surface has a negative surface charge density of 

10 -9 C rrr 2 . The potential difference of 400 kV between the top of the 
atmosphere and the surface results (due to the low conductivity of the 
lower atmosphere) in a current of only 1800 A over the entire globe. If 
there were no mechanism of sustaining atmospheric electric field, 
how much time (roughly) would be required to neutralise the earth’s 
surface? (This never happens in practice because there is a 
mechanism to replenish electric charges, namely the continual 
thunderstorms and lightning in different parts of the globe). (Radius of 

earth = 6.37 x io 6 m.) 

3.15 (a) Six lead-acid type of secondary cells each of emf 2.0 V and 
internal resistance 0.015 Q are joined in series to provide a supply to 
a resistance of 8.5 Q. What are the current drawn from the supply 
and its terminal voltage? 

(b) A secondary cell after long use has an emf of 1.9 V and a large 
internal resistance of 380 Q. What maximum current can be drawn 




from the cell? Could the cell drive the starting motor of a car? 

3.16 Two wires of equal length, one of aluminium and the other of 
copper have the same resistance. Which of the two wires is lighter? 
Hence explain why aluminium wires are preferred for overhead power 

cables. (p A! = 2.63 * 10 -8 Q m, p Cu = 1.72 * 1CT 8 Q m, Relative 

density of 

Al = 2.7, of Cu = 8.9.) 

3.17 What conclusion can you draw from the following observations 
on a resistor made of alloy manganin? 



Current Voltage 

A V 

Current Voltage 

A V 

0:2 3.94 

0.4 7.37 

0,6 11,3 

0,3 15.7 

1.0 19.7 

2.0 39.4 

3.0 59^ 

4,0 73,3 

5.0 93,6 

6.0 113.5 

7.0 133:2 

3,0 153,0 


3.18 Answer the following questions: 

(a) A steady current flows in a metallic conductor of non-uniform 
cross-section. Which of these quantities is constant along the 
conductor: current, current density, electric field, drift speed? 

(b) Is Ohm’s law universally applicable for all conducting elements? If 
not, give examples of elements which do not obey Ohm’s law. 

(c) A low voltage supply from which one needs high currents must 
have very low internal resistance. Why? 

(d) A high tension (HT) supply of, say, 6 kV must have a very large 
internal resistance. Why? 

3.19 Choose the correct alternative: 

(a) Alloys of metals usually have (greater/less) resistivity than that of 









their constituent metals. 

(b) Alloys usually have much (lower/higher) temperature coefficients 
of resistance than pure metals. 

(c) The resistivity of the alloy manganin is nearly independent 
of/increases rapidly with increase of temperature. 

(d) The resistivity of a typical insulator (e.g., amber) is greater than 
that of a metal by a factor of the order of (10 22 /10 23 ). 

3.20 (a) Given n resistors each of resistance R, how will you combine 
them to get the (i) maximum (ii) minimum effective resistance? What 
is the ratio of the maximum to minimum resistance? 

(b) Given the resistances of 1 Q, 2 Q, 3 Q, how will be combine them 
to get an equivalent resistance of (i) (11/3) Q (ii) (11/5) Q, (iii) 6 Q, (iv) 
(6/11)0? 

(c) Determine the equivalent resistance of networks shown in 
Fig. 3.31. 
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Figure 3.31 



3.21 Determine the current drawn from a 12V supply with internal 
resistance 0.5Q by the infinite network shown in Fig. 3.32. Each 
resistor has 10 resistance. 
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Figure 3.32 


3.22 Figure 3.33 shows a potentiometer with a cell of 2.0 V and 
internal resistance 0.40 Q maintaining a potential drop across the 
resistor wire AB. A standard cell which maintains a constant emf of 
1.02 V (for very moderate currents upto a few mA) gives a balance 
point at 67.3 cm length of the wire. To ensure very low currents drawn 
from the standard cell, a very high resistance of 600 kQ is put in 
series with it, which is shorted close to the balance point. The 
standard cell is then replaced by a cell of unknown emf s and the 
balance point found similarly, turns out to be at 82.3 cm length of the 
wire. 
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Figure 3.33 

(a) What is the value £? 

(b) What purpose does the high resistance of 600 kQ have? 

(c) Is the balance point affected by this high resistance? 

(d) Would the method work in the above situation if the driver cell of 
the potentiometer had an emf of 1 .OV instead of 2.0V? 

(e) Would the circuit work well for determining an extremely small 
emf, say of the order of a few mV (such as the typical emf of a 
thermo-couple)? If not, how will you modify the circuit? 

3.23 Figure 3.34 shows a 2.0 V potentiometer used for the 
determination of internal resistance of a 1.5 V cell. The balance point 
of the cell in open circuit is 76.3 cm. When a resistor of 9.5 Q is used 
in the external circuit of the cell, the balance point shifts to 64.8 cm 
length of the potentiometer wire. Determine the internal resistance of 
the cell. 
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Chapter Four 


Moving Charges and Magnetism 


4.1 Introduction 

Both Electricity and Magnetism have been known for more than 2000 
years. However, it was only about 200 years ago, in 1820, that it was 
realised that they were intimately related*. During a lecture demonstration 
in the summer of 1820, Danish physicist Hans Christian Oersted noticed 
that a current in a straight wire caused a noticeable deflection in a nearby 
magnetic compass needle. He investigated this phenomenon. He found 
that the alignment of the needle is tangential to an imaginary circle which 
has the straight wire as its centre and has its plane perpendicular to the 
wire. This situation is depicted in Fig.4.1(a). It is noticeable when the 
current is large and the needle sufficiently close to the wire so that the 
earth’s magnetic field may be ignored. Reversing the direction of the 
current reverses the orientation of the needle [Fig. 4.1(b)]. The deflection 
increases on increasing the current or bringing the needle closer to the 
wire. Iron filings sprinkled around the wire arrange themselves in 
concentric circles with the wire as the centre [Fig. 4.1(c)]. Oersted 
concluded that moving charges or currents produced a magnetic field in 
the surrounding space. 

Following this, there was intense experimentation. In 1864, the laws 
obeyed by electricity and magnetism were unified and formulated by 
James Maxwell who then realised that light was electromagnetic waves. 
Radio waves were discovered by Hertz, and produced by J.C.Bose and 

G. Marconi by the end of the 19 th century. A remarkable scientific and 




technological progress took place in the 20 th century. This was due to our 
increased understanding of electromagnetism and the invention of 
devices for production, amplification, transmission and detection of 


electromagnetic waves. 
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Figure 4.1 The magnetic field due to a straight long current-carrying wire. The wire is 
perpendicular to the plane of the paper. A ring of compass needles surrounds the wire. The 
orientation of the needles is shown when (a) the current emerges out of the plane of the paper, 
(b) the current moves into the plane of the paper, (c) The arrangement of iron filings around the 
wire. The darkened ends of the needle represent north poles. The effect of the earth’s magnetic 

field is neglected. 


In this chapter, we will see how magnetic field exerts forces on moving 
charged particles, like electrons, protons, and current-carrying wires. We 
shall also learn how currents produce magnetic fields. We shall see how 
particles can be accelerated to very high energies in a cyclotron. We 
shall study how currents and voltages are detected by a galvanometer. 




Hans Christian Oersted (1777-1851) Danish physicist and chemist, 
professor at Copenhagen. He observed that a compass needle 
suffers a deflection when placed near a wire carrying an electric 
current. This discovery gave the first empirical evidence of a 
connection between electric and magnetic phenomena. 


In this and subsequent Chapter on magnetism, we adopt the following 
convention: A current or a 

field (electric or magnetic) emerging out of the plane of the paper is 
depicted by a dot (O). A current or a field going into the plane of the 
paper is depicted by a cross (® )*. Figures. 4.1(a) and 4.1(b) correspond 
to these two situations, respectively. 

4.2 Magnetic Force 

4.2.1 Sources and fields 

Before we introduce the concept of a magnetic field B, we shall 
recapitulate what we have learnt in Chapter 1 about the electric field E. 
We have seen that the interaction between two charges can be 
considered in two stages. The charge Q, the source of the field, produces 
an electric field E, where 

* A dot appears like the tip of an arrow pointed at you, a cross is like the 
feathered tail of an arrow moving away from you. 

E = Q r / (4 TTE o)r 2 (4.1) 



where r is unit vector along r, and the field E is a vector field. A charge q 
interacts with this field and experiences a force F given by 



Hendrik Antoon Lorentz (1853 - 1928) Dutch theoretical physicist, 
professor at Leiden. He investigated the relationship between 
electricity, magnetism, and mechanics. In order to explain the 
observed effect of magnetic fields on emitters of light (Zeeman 
effect), he postulated the existence of electric charges in the atom, for 
which he was awarded the Nobel Prize in 1902. He derived a set of 
transformation equations (known after him, as Lorentz transformation 
equations) by some tangled mathematical arguments, but he was not 
aware that these equations hinge on a new concept of space and 
time. 


F = qE = qQ r / (4 m0 ) r 2 (4.2) 

As pointed out in the Chapter 1, the field E is not just an artefact but has 
a physical role. It can convey energy and momentum and is not 
established instantaneously but takes finite time to propagate. The 
concept of a field was specially stressed by Faraday and was 
incorporated by Maxwell in his unification of electricity and magnetism. In 
addition to depending on each point in space, it can also vary with time, 
i.e., be a function of time. In our discussions in this chapter, we will 
assume that the fields do not change with time. 



The field at a particular point can be due to one or more charges. If there 
are more charges the fields add vectorially. You have already learnt in 
Chapter 1 that this is called the principle of superposition. Once the field 
is known, the force on a test charge is given by Eq. (4.2). 

Just as static charges produce an electric field, the currents or moving 
charges produce (in addition) a magnetic field, denoted by B (r), again a 
vector field. It has several basic properties identical to the electric field. It 
is defined at each point in space (and can in addition depend on time). 
Experimentally, it is found to obey the principle of superposition: the 
magnetic field of several sources is the vector addition of magnetic field 
of each individual source. 

4.2.2 Magnetic Field, Lorentz Force 

Let us suppose that there is a point charge q (moving with a velocity v 
and, located at r at a given time t) in presence of both the electric field E 
(r) and the magnetic field B (r). The force on an electric charge q due to 
both of them can be written as 

^ — q[E(r) + v x B (r)] = F e | ectr j C + F ma g net j C (4.3) 

This force was given first by H.A. Lorentz based on the extensive 
experiments of Ampere and others. It is called the Lorentz force. You 
have already studied in detail the force due to the electric field. If we 
look at the interaction with the magnetic field, we find the following 
features. 

(i) It depends on q, v and B (charge of the particle, the velocity and the 
magnetic field). Force on a negative charge is opposite to that on a 
positive charge. 

(ii) The magnetic force q [ v x B ] includes a vector product of velocity 



and magnetic field. The vector product makes the force due to magnetic 
field vanish (become zero) if velocity and magnetic field are parallel or 
anti-parallel. The force acts in a (sideways) direction perpendicular to 
both the velocity and the magnetic field. Its direction is given by the screw 
rule or right hand rule for vector (or cross) product as illustrated in Fig. 
4.2. 

(iii) The magnetic force is zero if charge is not moving (as then |v|= 0). 
Only a moving charge feels the magnetic force. 

The expression for the magnetic force helps us to define the unit of the 
magnetic field, if one takes q, F and v, all to be unity in the force equation 
F = q [ v * B] =q v B sin e n, where 0 is the angle between v and B [see 

Fig. 4.2 (a)]. The magnitude of magnetic field B is 1 SI unit, when the 
force acting on a unit charge (1 C), moving perpendicular to B with a 
speed Im/s, is one newton. 



Figure 4.2 The direction of the magnetic force acting on a charged particle, (a) The force on a 
positively charged particle with velocity v and making an angle 0 with the magnetic field B is given 
by the right-hand rule, (b) A moving charged particle q is deflected in an opposite sense to -q in 

the presence of magnetic field. 


Dimensionally, we have [B] = [F/qv] and the unit of B are Newton second 
/ (coulomb metre). This unit is called tesla (T) named after Nikola Tesla 









(1856 - 1943). Tesla is a rather large unit. A smaller unit (non-SI) called 
gauss (=10 -4 tesla) is also often used. The earth’s magnetic field is about 

3.6 x 10 -5 T. Table 4.1 lists magnetic fields over a wide range in the 
universe. 


Table 4.1 Order of magnitudes of magnetic fields in a variety of physical situations 


Physical situation 

Magnitude of B (In tesla) 

Surface of a neutron star 

10 a 

Typical large field in a laboratoiy 

1 

Near a small bar magnet 

KT 2 

On the earth's surface 

10" ! 

Human nerve fibre 

KT 10 

Interstellar space 

o 

1 


4.2.3 Magnetic force on a current-carrying conductor 

We can extend the analysis for force due to magnetic field on a single 
moving charge to a straight rod carrying current. Consider a rod of a 
uniform cross-sectional area A and length I. We shall assume one kind of 
mobile carriers as in a conductor (here electrons). Let the number density 
of these mobile charge carriers in it be n. Then the total number of mobile 
charge carriers in it is nIA. For a steady current I in this conducting rod, 
we may assume that each mobile carrier has an average drift velocity v d 

(see Chapter 3). In the presence of an external magnetic field B, the 
force on these carriers is: 

F = (nlA)q v d x B 

where q is the value of the charge on a carrier. Now nq v d is the current 






density j and |(nq v d )|A is the current I (see Chapter 3 for the discussion 
of current and current density). Thus, 

F = [(nq v d )IA] x B = [ jAI ] x B 

= II x B (4.4) 

where I is a vector of magnitude I, the length of the rod, and with a 
direction identical to the current I. Note that the current I is not a vector. In 
the last step leading to Eq. (4.4), we have transferred the vector sign 
from j to I. 

Equation (4.4) holds for a straight rod. In this equation, B is the external 
magnetic field. It is not the field produced by the current-carrying rod. If 
the wire has an arbitrary shape we can calculate the Lorentz force on it 
by considering it as a collection of linear strips dlj and summing 

F - ^[<11, xB 

This summation can be converted to an integral in most cases. 


On permittivity and permeability 

In the universal law of gravitation, we say that any two point masses 
exert a force on each other which is proportional to the product of the 
masses m 1; m 2 and inversely proportional to the square of the 

distance r between them. We write it as F = Gm 1 m 2 /r 2 where G is the 

universal constant of gravitation. Similarly, in Coulomb’s law 
of electrostatics we write the force between two point chargesq-], q 2 , 

separated by a distance r as 



F = kq 1 q 2 /r 2 where k is a constant of proportionality. In SI units, k is 
taken as 

1/4^ where £ is the permittivity of the medium. Also in magnetism, we 
get another constant, which in SI units, is taken as M /4 n where M is the 
permeability of the medium. 

Although G, £ and M arise as proportionality constants, there is a 

difference between gravitational force and electromagnetic force. 
While the gravitational force does not depend on the intervening 
medium, the electromagnetic force depends on the medium between 
the two charges or magnets. Hence, while G is a universal 
constant, £ and M depend on the medium. They have different values 

for different media. The product £M turns out to be related to the 
speed v of electromagnetic radiation in the medium through Z[i =1/ v 2 . 

Electric permittivity £ is a physical quantity that describes how an 

electric field affects and is affected by a medium. It is determined by 
the ability of a material to polarise in response to an applied field, and 
thereby to cancel, partially, the field inside the material. Similarly, 
magnetic permeability M is the ability of a substance to acquire 

magnetisation in magnetic fields. It is a measure of the extent to 
which magnetic field can penetrate matter. 


Example 4.1 A straight wire of mass 200 g and length 1.5 m carries a 
current of 2 A. It is suspended in mid-air by a uniform horizontal 
magnetic field B (Fig. 4.3). What is the magnitude of the magnetic 






field? 



Solution From Eq. (4.4), we find that there is an upward force F, of 
magnitude MB,. For mid-air suspension, this must be balanced by the 
force due to gravity: 

m g = I IB 

a = a 

il 


2 ■■■' 1,5 

Note that it would have been sufficient to specify m/I, the mass per 
unit length of the wire. The earth’s magnetic field is approximately 
4 x 10-5 T and we have ignored it. 


PHYSICS 

Charged particles moving in a magnetic field. 

Interactive demonstration: 

http://www.phys.hawaii.edu/~teb/optics/java/partmagn/index.html 











Example 4.2 If the magnetic field is parallel to the positive y-axis and 
the charged particle is moving along the positive x-axis (Fig. 4.4), 
which way would the Lorentz force be for (a) an electron (negative 

charge), (b) a proton (positive charge). 

2 

A 
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Figure 4.4 

Solution The velocity v of particle is along the x-axis, while B, the 
magnetic field is along the y-axis, so v * B is along the z-axis (screw 
rule or right-hand thumb rule). So, (a) for electron it will be along -z 
axis, (b) for a positive charge (proton) the force is along +z axis. 


4.3 Motion in a Magnetic Field 

We will now consider, in greater detail, the motion of a charge moving in 
a magnetic field. We have learnt in Mechanics (see Class XI book, 
Chapter 6) that a force on a particle does work if the force has a 
component along (or opposed to) the direction of motion of the particle. In 
the case of motion of a charge in a magnetic field, the magnetic force is 
perpendicular to the velocity of the particle. So no work is done and no 
change in the magnitude of the velocity is produced (though the direction 
of momentum may be changed). [Notice that this is unlike the force due 
to an electric field, qE, which can have a component parallel (or 





antiparallel) to motion and thus can transfer energy in addition to 
momentum.] 

We shall consider motion of a charged particle in a uniform magnetic 
field. First consider the case of v perpendicular to B. The perpendicular 
force, q v * B, acts as a centripetal force and produces a circular motion 
perpendicular to the magnetic field. The particle will describe a circle if v 

and B are perpendicular to each other (Fig. 4.5). 

B 

xxxxxxxxx 



Figure 4.5 Circular motion 

If velocity has a component along B, this component remains unchanged 
as the motion along the magnetic field will not be affected by the 
magnetic field. The motion in a plane perpendicular to B is as before a 
circular one, thereby producing a helical motion (Fig. 4.6). 

You have already learnt in earlier classes (See Class XI, Chapter 4) that 

if r is the radius of the circular path of a particle, then a force of m v 2 / r, 
acts perpendicular to the path towards the centre of the circle, and is 
called the centripetal force. If the velocity v is perpendicular to the 
magnetic field B, the magnetic force is perpendicular to both v and B and 




acts like a centripetal force. It has a magnitude q v B. Equating the two 
expressions for centripetal force, 



Figure 4.6 Helical motion 


m v 2 /r = q v B, which gives 
r = m v / qB (4.5) 

for the radius of the circle described by the charged particle. The larger 
the momentum, the larger is the radius and bigger the circle described. If 
w is the angular frequency, then v = w r. So, 

w = 2 tt v = q B/ m [4.6(a)] 

which is independent of the velocity or energy . Here v is the frequency of 
rotation. The independence of v from energy has important application in 
the design of a cyclotron (see Section 4.4.2). 




The time taken for one revolution is T= = 1/ v . If there is a component 

of the velocity parallel to the magnetic field (denoted by V||), it will make 

the particle move along the field and the path of the particle would be a 
helical one (Fig. 4.6). The distance moved along the magnetic field in one 
rotation is called pitch p. Using Eq. [4.6 (a)], we have 

P = V||T = 2 n m V|| / q B [4.6(b)] 

The radius of the circular component of motion is called the radius of the 
helix. 


Example 4.3 What is the radius of the path of an electron (mass 
9 x 10-31 kg and charge 1.6 * 10-19 C) moving at a speed of 3 *107 
m/s in a magnetic field of 6 * 10-4 T perpendicular to it? What is its 
frequency? Calculate its energy in keV. (1 eV = 1.6 * 10-19 J). 

Solution Using Eq. (4.5) we find 

r = m v / (qB) = 9 xiO-31 kg x 3 x 107 m s-1/( 1.6 x 10-19 C x 6 x 
10-4 T) 

= 26 x 10-2 m = 26 cm 

v - v / (2 ur) — 2x106 s-1 = 2x106 Hz =2 MHz. 

E = (7 2 )mv 2 = (7 2 ) 9 x 10-31 kg x g x 1014 m2/s2 = 40.5 xiO-17 J 
= 4x10-16 J = 2.5 keV. 



Helical motion of charged particles and aurora borealis 

In polar regions like Alaska and Northern Canada, a splendid display 
of colours is seen in the sky. The appearance of dancing green pink 
lights is fascinating, and equally puzzling. An explanation of this 
natural phenomenon is now found in physics, in terms of what we 
have studied here. 

Consider a charged particle of mass m and charge q, entering a 
region of magnetic field B with an initial velocity v. Let this velocity 
have a component v p parallel to the magnetic field and a 

component v n normal to it. There is no force on a charged particle in 

the direction of the field. Hence the particle continues to travel with 
the velocity v p parallel to the field. The normal component v n of the 

particle results in a Lorentz force (v n * B) which is perpendicular to 

both v n and B. As seen in Section 4.3.1 the particle thus has a 

tendency to perform a circular motion in a plane perpendicular to the 
magnetic field. When this is coupled with the velocity parallel to the 
field, the resulting trajectory will be a helix along the magnetic field 
line, as shown in Figure (a) here. Even if the field line bends, the 
helically moving particle is trapped and guided to move around the 
field line. Since the Lorentz force is normal to the velocity of each 
point, the field does no work on the particle and the magnitude of 
velocity remains the same. 





During a solar flare, a large number of electrons and protons are 
ejected from the sun. Some of them get trapped in the earth’s 
magnetic field and move in helical paths along the field lines. The 
field lines come closer to each other near the magnetic poles; 
see figure (b). Hence the density of charges increases near the 
poles. These particles collide with atoms and molecules of the 
atmosphere. Excited oxygen atoms emit green light and excited 
nitrogen atoms emits pink light. This phenomenon is calledAurora 
Borealis in physics. 


4.4 Motion in Combined Electric and Magnetic Fields 

4.4.1 Velocity selector 

You know that a charge q moving with velocity v in presence of both 
electric and magnetic fields experiences a force given by Eq. (4.3), that 
is, 









F = q (E + v x B) = F E + F B 


We shall consider the simple case in which electric and magnetic fields 
are perpendicular to each other and also perpendicular to the velocity of 
the particle, as shown in Fig. 4.7. We have, 

E = E j, B = Bk. v z i.'i 

F e - qE - qE j, F e = qv xB, = q(ci x ] = -qf? j 
Therefore, F = r/(E - 

Thus, electric and magnetic forces are in opposite directions as shown in 
the figure. Suppose, we adjust the value of E and B such that 
magnitudes of the two forces are equal. Then, total force on the charge is 
zero and the charge will move in the fields undeflected. This happens 
when, 

Figure 4.7 



cfE = qvB or v = (4.7) 

This condition can be used to select charged particles of a particular 
velocity out of a beam containing charges moving with different speeds 
(irrespective of their charge and mass). The crossed E and B fields, 





therefore, serve as a velocity selector. Only particles with speed E/B pass 
undeflected through the region of crossed fields. This method was 
employed by J. J. Thomson in 1897 to measure the charge to mass ratio 
(e/m) of an electron. The principle is also employed in Mass 
Spectrometer - a device that separates charged particles, usually ions, 
according to their charge to mass ratio. 


PHYSICS 

Cyclotron 

Interactive demonstration: 

http://www.phy.ntnu.edu.tw/ntnujava/index.php?topic=33.0 


4.4.2 Cyclotron 

The cyclotron is a machine to accelerate charged particles or ions to high 
energies. It was invented by E.O. Lawrence and M.S. Livingston in 1934 
to investigate nuclear structure. The cyclotron uses both electric and 
magnetic fields in combination to increase the energy of charged 
particles. As the fields are perpendicular to each other they are called 
crossed fields. Cyclotron uses the fact that the frequency of revolution of 
the charged particle in a magnetic field is independent of its energy. The 
particles move most of the time inside two semicircular disc-like metal 
containers, D 1 and D 2 , which are called dees as they look like the letter 

D. Figure 4.8 shows a schematic view of the cyclotron. Inside the metal 
boxes the particle is shielded and is not acted on by the electric field. The 
magnetic field, however, acts on the particle and makes it go round in a 



circular path inside a dee. Every time the particle moves from one dee to 
another it is acted upon by the electric field. The sign of the electric field 
is changed alternately in tune with the circular motion of the particle. This 
ensures that the particle is always accelerated by the electric field. Each 
time the acceleration increases the energy of the particle. As energy 
increases, the radius of the circular path increases. So the path is a 
spiral one. 

The whole assembly is evacuated to minimise collisions between the ions 
and the air molecules. A high frequency alternating voltage is applied to 
the dees. In the sketch shown in Fig. 4.8, positive ions or positively 
charged particles (e.g., protons) are released at the centre P. They move 
in a semi-circular path in one of the dees and arrive in the gap between 
the dees in a time interval T/2; where T, the period of revolution, is given 
by Eq. (4.6), 

1 2 Tim 
v.. <{B 


or 


1 / 


qB_ 

2rcm 


(4.8) 


This frequency is called the cyclotron frequency for obvious reasons and 
is denoted by vc . 


The frequency va of the applied voltage is adjusted so that the polarity of 

the dees is reversed in the same time that it takes the ions to complete 
one half of the revolution. The requirement va = vc is called the resonance 

condition. The phase of the supply is adjusted so that when the positive 
ions arrive at the edge of D-|, D 2 is at a lower potential and the ions are 

accelerated across the gap. Inside the dees the particles travel in a 
region free of the electric field. The increase in their kinetic energy is qV 



each time they cross from one dee to another (V refers to the voltage 
across the dees at that time). From Eq. (4.5), it is clear that the radius of 
their path goes on increasing each time their kinetic energy increases. 
The ions are repeatedly accelerated across the dees until they have the 
required energy to have a radius approximately that of the dees. They 
are then deflected by a magnetic field and leave the system via an exit 
slit. From Eq. (4.5) we have, 


qDR 

m 


(4.9) 


where R is the radius of the trajectory at exit, and equals the radius of a 
dee. 

Magnetic field out Deflection plate 



Figure 4.8 A schematic sketch of the cyclotron. There is a source of charged particles or ions at P 
which move in a circular fashion in the dees, D1 and D2, on account of a uniform perpendicular 


























magnetic field B. An alternating voltage source accelerates these ions to high speeds. The ions 

are eventually ‘extracted’at the exit port. 


Hence, the kinetic energy of the ions is, 


1 

mv 

2 



(4.10) 


The operation of the cyclotron is based on the fact that the time for one 
revolution of an ion is independent of its speed or radius of its orbit. The 
cyclotron is used to bombard nuclei with energetic particles, so 
accelerated by it, and study the resulting nuclear reactions. It is also used 
to implant ions into solids and modify their properties or even synthesise 
new materials. It is used in hospitals to produce radioactive substances 
which can be used in diagnosis and treatment. 


Example 4.4 A cyclotron’s oscillator frequency is 10 MHz. What 
should be the operating magnetic field for accelerating protons? If the 
radius of its ‘dees’ is 60 cm, what is the kinetic energy (in MeV) of the 
proton beam produced by the accelerator. 

(e =1.60 x 10-19 C, mp = 1.67 x 10-27 kg, 1 MeV = 1.6 x 10-13 J). 

Solution The oscillator frequency should be same as proton’s 
cyclotron frequency. 

Using Eqs. (4.5) and [4.6(a)] we have 

B = 2tt m v/q =6.3 x1. 67 x 10-27 x 107/(1.6 x 10-19) = 0.66 T 
Final velocity of protons is 
v = r x 2 tt v = 0.6 m x 6.3 xi07 = 3.78 x 107 m/s. 

E = 7 2 mv 2 = 1.67 xlO-27 x 14.3 x 1014 / (2 x 1.6 x 10-13) = 7 



MeV. 

w 




Accelerators in India 

India has been an early entrant in the area of accelerator-based 
research. The vision of 

Dr. Meghnath Saha created a 37” Cyclotron in the Saha Institute of 
Nuclear Physics in Kolkata in 1953. This was soon followed by a 
series of Cockroft-Walton type of accelerators established in Tata 
Institute of Fundamental Research (TIFR), Mumbai, Aligarh Muslim 
University (AMU), Aligarh, Bose Institute, Kolkata and Andhra 
University, Waltair. 

The sixties saw the commissioning of a number of Van de Graaff 
accelerators: a 5.5 MV terminal machine in Bhabha Atomic Research 
Centre (BARC), Mumbai (1963); a 2 MV terminal machine in Indian 
Institute of Technology (NT), Kanpur; a 400 kV terminal machine in 
Banaras Hindu University (BHU), Varanasi; and Punjabi University, 
Patiala. One 66 cm Cyclotron donated by the Rochester University of 
USA was commissioned in Panjab University, Chandigarh. A small 
electron accelerator was also established in University of Pune, 
Pune. 

In a major initiative taken in the seventies and eighties, a Variable 
Energy Cyclotron was built indigenously in Variable Energy Cyclotron 
Centre (VECC), Kolkata; 2 MV Tandem Van de Graaff accelerator 
was developed and built in BARC and a 14 MV Tandem Pelletron 
accelerator was installed in TIFR. 






This was soon followed by a 15 MV Tandem Pelletron established by 
University Grants Commission (UGC), as an inter-university facility in 
Inter-University Accelerator Centre (IUAC), New Delhi; a 3 MV 
Tandem Pelletron in Institute of Physics, Bhubaneswar; and two 1.7 
MV Tandetrons in Atomic Minerals Directorate for Exploration and 
Research, Hyderabad and Indira Gandhi Centre for Atomic 
Research, Kalpakkam. Both TIFR and IUAC are augmenting their 
facilities with the addition of superconducting LINAC modules to 
accelerate the ions to higher energies. 

Besides these ion accelerators, the Department of Atomic Energy 
(DAE) has developed many electron accelerators. A 2 GeV 
Synchrotron Radiation Source is being built in Raja Ramanna Centre 
for Advanced Technologies, Indore. 

The Department of Atomic Energy is considering Accelerator Driven 
Systems (ADS) for power production and fissile material breeding as 
future options. 


4.5 Magnetic Field due to a Current Element, Biot-Savart 
Law 

All magnetic fields that we know are due to currents (or moving charges) 
and due to intrinsic magnetic moments of particles. Here, we shall study 
the relation between current and the magnetic field it produces. It is given 
by the Biot-Savart’s law. Figure 4.9 shows a finite conductor XY carrying 
current I. Consider an infinitesimal element dl of the conductor. The 
magnetic field dB due to this element is to be determined at a point P 
which is at a distance r from it. Let e be the angle between dl and the 

displacement vector r. According to Biot-Savart’s law, the magnitude of 
the magnetic field dB is proportional to the current I, the element length 



|dl|, and inversely proportional to the square of the distance r. Its 
direction* is perpendicular to the plane containing dl and r . Thus, in 
vector notation, 


Current element 



Figure 4.9 Illustration of the Biot-Savart law. The current element I dl produces a field dB at a 
distance r. The ® sign indicates that the field is perpendicular to the plane of this page and 

directed 
into it. 
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[4.11(a)] 


where ^0/4^ is a constant of proportionality. The above expression holds 
when the medium is vacuum. 


The magnitude of this field is, 


dB 


u ., l dl sin d 

1 7T r 2 


[4.11(b)] 


where we have used the property of cross-product. Equation [4.11 (a)] 








constitutes our basic equation for the magnetic field. The proportionality 
constant in SI units has the exact value, 


" =10 1 Tin/A [4.11(c)] 

4t: 

We call M 0 the permeability of free space (or vacuum). 

The Biot-Savart law for the magnetic field has certain similarities, as well 
as, differences with the Coulomb’s law for the electrostatic field. Some of 
these are: 

(i) Both are long range, since both depend inversely on the square of 
distance from the source to the point of interest. The principle of 
superposition applies to both fields. [In this connection, note that the 
magnetic field is linear in the source I dl just as the electrostatic field is 
linear in its source: the electric charge.] 

(ii) The electrostatic field is produced by a scalar source, namely, the 
electric charge. The magnetic field is produced by a vector source 

I dl. 

(iii) The electrostatic field is along the displacement vector joining the 
source and the field point. The magnetic field is perpendicular to the 
plane containing the displacement vector r and the current element 

I dl. 

(iv) There is an angle dependence in the Biot-Savart law which is not 
present in the electrostatic case. In Fig. 4.9, the magnetic field at any 
point in the direction of dl (the dashed line) is zero. Along this line, 

e = 0, sin Q = 0 and from Eq. [4.11(a)], |dB| = 0. 

There is an interesting relation between e0 , the permittivity of free space; 
M 0, the permeability of free space; and c, the speed of light in 



vacuum: 


* The sense of dl * r is also given by the Right Hand Screw rule : Look at 

the plane containing vectors dl and r. Imagine moving from the first vector 
towards second vector. If the movement is anticlockwise, the resultant is 
towards you. If it is clockwise, the resultant is away from you. 
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We will discuss this connection further in Chapter 8 on the 
electromagnetic waves. Since the speed of light in vacuum is constant, 
the product m0e0 ‘ s fixed ‘ n magnitude. Choosing the value of either e0 or 

m0 , fixes the value of the other. In SI units, m0 is fixed to be equal to 
4 n x 10 -7 in magnitude. 


Example 4.5 An element Al = Ax i is placed at the origin and carries 

a large current I = 10 A (Fig. 4.10). What is the magnetic field on the 

y-axis at a distance of 0.5 m. Ax = 1 cm. 

y 



Figure 4.10 

Solution 
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The direction of the field is in the +z-direction. This is so since, 

di x r - Ax i x y j 

!/-\x( i x j) = y ; \x k 

We remind you of the following cyclic property of cross-products, 

i x j = k; j x k = i; 

x 

J 

II 

Note that the field is small in magnitude. 


In the next section, we shall use the Biot-Savart law to calculate the 
magnetic field due to a circular loop. 

4.6 Magnetic Field on the Axis of a Circular Current Loop 

In this section, we shall evaluate the magnetic field due to a circular coil 
along its axis. The evaluation entails summing up the effect of 
infinitesimal current elements (I dl) mentioned in the previous section. We 
assume that the current I is steady and that the evaluation is carried out 
in free space (i.e., vacuum). 

Figure 4.11 depicts a circular loop carrying a steady current I. The loop is 
placed in the y-z plane with its centre at the origin O and has a radius R. 
The x-axis is the axis of the loop. We wish to calculate the magnetic field 




at the point P on this axis. Let x be the distance of P from the centre O of 
the loop. 

Consider a conducting element dl of the loop. This is shown in Fig. 4.11. 
The magnitude dB of the magnetic field due to dl is given by the Biot- 
Savart law [Eq. 4.11(a)], 



dB - ' " 
4/7 


Now r 2 = x 2 + R 2 . Further, any element of the loop will be perpendicular 
to the displacement vector from the element to the axial point. For 
example, the element dl in Fig. 4.11 is in the y-z plane, whereas, the 
displacement vector r from dl to the axial point P is in the x-y plane. 
Hence |dl x r|=r dl. Thus, 
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Figure 4.11 Magnetic field on the axis of a current carrying circular loop of radius R. Shown are 
the magnetic field dB (due to a line element dl) and its components along and perpendicular to 

the axis. 
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(4.13) 


The direction of dB is shown in Fig. 4.11. It is perpendicular to the plane 
formed by dl and r. It has an x-component dB x and a component 

perpendicular to x-axis, dB±. When the components perpendicular to the 
x-axis are summed over, they cancel out and we obtain a null result. For 
example, the dB_L component due to dl is cancelled by the contribution 
due to the diametrically opposite dl element, shown in 
Fig. 4.11. Thus, only the x-component survives. The net contribution 
along x-direction can be obtained by integrating dB x = dB cos e over the 

loop. For Fig. 4.11, 
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From Eqs. (4.13) and (4.14), 
U r ML R 

U 2 + R Z Y 


dH = 


/? 


The summation of elements dl over the loop yields 2 TT R, the 

circumference of the loop. Thus, the magnetic field at P due to entire 
circular loop is 


B = B i 




2 (x 2 + R 2 ) 


(4.15) 


As a special case of the above result, we may obtain the field at the 
centre of the loop. Here x = 0, and we obtain, 

B.. = — i (4.16) 

0 2 R 



The magnetic field lines due to a circular wire form closed loops and are 
shown in Fig. 4.12. The direction of the magnetic field is given by 
(another) right-hand thumb rule stated below: 



Curl the palm of your right hand around the circular wire with the fingers 
pointing in the direction of the current. The right-hand thumb gives the 
direction of the magnetic field. 


FIGURE 4.12 The magnetic field lines for a current loop. The direction of the field is given by the 
right-hand thumb rule described in the text. The upper side of the loop may be thought of as the 
north pole and the lower side as the south pole of a magnet. 


Example 4.6 A straight wire carrying a current of 12 A is bent into a 
semi-circular arc of radius 2.0 cm as shown in Fig. 4.13(a). Consider 
the magnetic field B at the centre of the arc. (a) What is the magnetic 
field due to the straight segments? (b) In what way the contribution to 
B from the semicircle differs from that of a circular loop and in what 
way does it resemble? (c) Would your answer be different if the wire 
were bent into a semi-circular arc of the same radius but in the 
opposite way as shown in Fig. 4.13(b)? 









V 



(a) 


(b) 


Figure 4.13 

Solution 

(a) dl and r for each element of the straight segments are parallel. 
Therefore, dl x r = 0. Straight segments do not contribute to 


|B|. 


(b) For all segments of the semicircular arc, dl x r are all parallel to 
each other (into the plane of the paper). All such contributions add up 
in magnitude. Hence direction of B for a semicircular arc is given by 
the right-hand rule and magnitude is half that of a circular loop. Thus 
B is 1.9 x 10-4 T normal to the plane of the paper going into it. 

(c) Same magnitude of B but opposite in direction to that in (b). 

Example 4.7 Consider a tightly wound 100 turn coil of radius 10 cm, 
carrying a current of 1 A. What is the magnitude of the magnetic field 
at the centre of the coil? 

Solution Since the coil is tightly wound, we may take each circular 
element to have the same radius R = 10 cm = 0.1 m. The number of 
turns N = 100. The magnitude of the magnetic field is, 



- 2ti x 1 0 1 = 6.28 x 10' 1 T 







4.7 Ampere’s Circuital Law 

There is an alternative and appealing way in which the Biot-Savart law 
may be expressed. Ampere’s circuital law considers an open surface with 
a boundary (Fig. 4.14). The surface has current passing through it. We 
consider the boundary to be made up of a number of small line elements. 
Consider one such element of length dl. We take the value of the 
tangential component of the magnetic field, B t , at this element and 

multiply it by the length of that element dl [Note: B t dl=B.dl]. All such 

products are added together. We consider the limit as the lengths of 
elements get smaller and their number gets larger. The sum then tends to 
an integral. Ampere’s law states that this integral is equal to m0 times the 

total current passing through the surface, i.e., 

°B.dl = m 0 I [4.17(a)] 

where I is the total current through the surface. The integral is taken over 
the closed loop coinciding with the boundary C of the surface. The 
relation above involves a sign-convention, given by the right-hand rule. 
Let the fingers of the right-hand be curled in the sense the boundary is 

traversed in the loop integral ^B.dl. Then the direction of the thumb gives 
the sense in which the current I is regarded as positive. 



For several applications, a much simplified version of Eq. [4.17(a)] proves 
sufficient. We shall assume that, in such cases, it is possible to choose 
the loop (called an amperian loop) such that at each point of the loop, 
either 

(i) B is tangential to the loop and is a non-zero constant B, or 

(ii) B is normal to the loop, or 

(iii) B vanishes. 

Now, let L be the length (part) of the loop for which B is tangential. Let l e 
be the current enclosed by the loop. Then, Eq. (4.17) reduces to, 

BL Vet 4 -17(b)] 

When there is a system with a symmetry such as for a straight infinite 
current-carrying wire in Fig. 4.15, the Ampere’s law enables an easy 
evaluation of the magnetic field, much the same way Gauss’ law helps in 
determination of the electric field. This is exhibited in the Example 4.9 
below. The boundary of the loop chosen is a circle and magnetic field is 
tangential to the circumference of the circle. The law gives, for the left 
hand side of Eq. [4.17 (b)], B. 2 TT r. We find that the magnetic field at a 

distance r outside the wire is tangential and given by 



















Andre Ampere (1775 -1836) Andre Marie Ampere was a French 
physicist, mathematician and chemist who founded the science of 
electrodynamics. Ampere was a child prodigy who mastered 
advanced mathematics by the age of 12. Ampere grasped the 
significance of Oersted’s discovery. He carried out a large series of 
experiments to explore the relationship between current electricity 
and magnetism. These investigations culminated in 1827 with the 
publication of the ‘Mathematical Theory of Electrodynamic Pheno¬ 
mena Deduced Solely from Experiments’. He hypo- thesised that all 
magnetic phenomena are due to circulating electric currents. Ampere 
was humble and absent-minded. He once forgot an invitation to dine 
with the Emperor Napoleon. He died of pneumonia at the age of 61. 
His gravestone bears the epitaph: Tandem Felix (Happy at last). 
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B = M o 1/ (2 n r) (4.18) 

The above result for the infinite wire is interesting from several points of 


view. 



(i) It implies that the field at every point on a circle of radius r, (with the 
wire along the axis), is same in magnitude. In other words, the magnetic 
field possesses what is called a cylindrical symmetry. The field that 
normally can depend on three coordinates depends only on one: r. 
Whenever there is symmetry, the solutions simplify. 

(ii) The field direction at any point on this circle is tangential to it. Thus, 
the lines of constant magnitude of magnetic field form concentric circles. 
Notice now, in Fig. 4.1(c), the iron filings form concentric circles. These 
lines called magnetic field lines form closed loops. This is unlike the 
electrostatic field lines which originate from positive charges and end at 
negative charges. The expression for the magnetic field of a straight wire 
provides a theoretical justification to Oersted’s experiments. 

(iii) Another interesting point to note is that even though the wire is 
infinite, the field due to it at a non-zero distance is not infinite. It tends to 
blow up only when we come very close to the wire. The field is directly 
proportional to the current and inversely proportional to the distance from 
the (infinitely long) current source. 

(iv) There exists a simple rule to determine the direction of the magnetic 
field due to a long wire. This rule, called the right-hand rule*, is: 

Grasp the wire in your right hand with your extended thumb pointing in 
the direction of the current. Your fingers will curl around in the direction of 
the magnetic field. 

Ampere’s circuital law is not new in content from Biot-Savart law. Both 
relate the magnetic field and the current, and both express the same 
physical consequences of a steady electrical current. Ampere’s law is to 
Biot-Savart law, what Gauss’s law is to Coulomb’s law. Both, Ampere’s 
and Gauss’s law relate a physical quantity on the periphery or boundary 
(magnetic or electric field) to another physical quantity, namely, the 



source, in the interior (current or charge). We also note that Ampere’s 
circuital law holds for steady currents which do not fluctuate with time. 
The following example will help us understand what is meant by the term 
enclosed current. 


Example 4.8 Figure 4.15 shows a long straight wire of a circular 

cross-section (radius a) carrying steady current I. The current I is 

uniformly distributed across this cross-section. Calculate the 

magnetic field in the region r < a and r > a. 

p 

_ 
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Figure 4.15 


Solution (a) Consider the case r > a. The Amperian loop, labelled 2, 
is a circle concentric with the cross-section. For this loop, 

L = 2 tt r 


le = Current enclosed by the loop = I 

The result is the familiar expression for a long straight wire 

B (2tt r) = pOI 

B [4.19(a)] 

2 fir 







1 (r> a) 

r 

(b) Consider the case r < a. The Amperian loop is a circle labelled 1. 
For this loop, taking the radius of the circle to be r, 

L = 2 tt r 


Now the current enclosed le is not I, but is less than this value. 
Since the current distribution is uniform, the current enclosed is, 
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Using Ampere’s law, b [2 rj = //. —- 
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r [4.19(b)] 


* Note that there are two distinct right-hand rules: One which gives 

the direction of B on the axis of current-loop and the other which 
gives direction of B 

for a straight conducting wire. Fingers and thumb play different roles 
in 

the two. 
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Figure 4.16 












Figure (4.16) shows a plot of the magnitude of B with distance r from 
the centre of the wire. The direction of the field is tangential to the 
respective circular loop (1 or 2) and given by the right-hand rule 
described earlier in this section. 

This example possesses the required symmetry so that Ampere’s law 
can be applied readily. 

It should be noted that while Ampere’s circuital law holds for any loop, it 
may not always facilitate an evaluation of the magnetic field in every 
case. For example, for the case of the circular loop discussed in Section 
4.6, it cannot be applied to extract the simple expression 

B = m 0 I/2R [Eq. (4.16)] for the field at the centre of the loop. However, 

there exists a large number of situations of high symmetry where the law 
can be conveniently applied. We shall use it in the next section to 
calculate the magnetic field produced by two commonly used and very 
useful magnetic systems: the solenoid and the toroid. 

4.8 The Solenoid and the Toroid 

The solenoid and the toroid are two pieces of equipment which generate 
magnetic fields. The television uses the solenoid to generate magnetic 
fields needed. The synchrotron uses a combination of both to generate 
the high magnetic fields required. In both, solenoid and toroid, we come 
across a situation of high symmetry where Ampere’s law can be 
conveniently applied. 

4.8.1 The solenoid 

We shall discuss a long solenoid. By long solenoid we mean that the 
solenoid’s length is large compared to its radius. It consists of a long wire 
wound in the form of a helix where the neighbouring turns are closely 



spaced. So each turn can be regarded as a circular loop. The net 
magnetic field is the vector sum of the fields due to all the turns. 
Enamelled wires are used for winding so that turns are insulated from 
each other. 

Figure 4.17 displays the magnetic field lines for a finite solenoid. We 
show a section of this solenoid in an enlarged manner in Fig. 4.17(a). 
Figure 4.17(b) shows the entire finite solenoid with its magnetic field. In 
Fig. 4.17(a), it is clear from the circular loops that the field between two 
neighbouring turns vanishes. In Fig. 4.17(b), we see that the field at the 
interior mid-point P is uniform, strong and along the axis of the solenoid. 
The field at the exterior mid-point Q is weak and moreover is along the 
axis of the solenoid with no perpendicular or normal component. As the 
solenoid is made longer it appears like a long cylindrical metal sheet. 


Figure 4.18 represents this idealised picture. The field outside the 
solenoid approaches zero. We shall assume that the field outside is zero. 
The field inside becomes everywhere parallel to the axis. 



FIGURE 4.17 (a) The magnetic field due to a section of the solenoid which has been stretched out 
for clarity. Only the exterior semi-circular part is shown. Notice how the circular loops between 
neighbouring turns tend to cancel, (b) The magnetic field of a finite solenoid. 

Consider a rectangular Amperian loop abed. Along cd the field is zero as 













argued above. Along transverse sections be and ad, the field component 
is zero. Thus, these two sections make no contribution. Let the field along 
ab be B. Thus, the relevant length of the Amperian loop is, L = h. 

Let n be the number of turns per unit length, then the total number of 
turns is nh. The enclosed current is, l e = I (n h), where I is the current in 

the solenoid. From Ampere’s circuital law [Eq. 4.17 (b)] 

BL = MO l e , B h = m0 I (n h) 

B = M0 n I (4.20) 

■ Q 



Figure 4.18 The magnetic field of a very long solenoid. We consider a rectangular Amperian loop 

abed to determine the field. 

The direction of the field is given by the right-hand rule. The solenoid is 
commonly used to obtain a uniform magnetic field. We shall see in the 
next chapter that a large field is possible by inserting a soft iron core 
inside the solenoid. 

4.8.2 The toroid 

The toroid is a hollow circular ring on which a large number of turns of a 
wire are closely wound. It can be viewed as a solenoid which has been 
bent into a circular shape to close on itself. It is shown in Fig. 4.19(a) 
carrying a current I. We shall see that the magnetic field in the open 































space inside (point P) and exterior to the toroid (point Q) is zero. The field 
B inside the toroid is constant in magnitude for the ideal toroid of closely 
wound turns. 


B 



(a) 



lb) 

Figure 4.19 (a) A toroid carrying a current I. (b) A sectional view of the toroid. The magnetic field 
can be obtained at an arbitrary distance r from the centre O of the toroid by Ampere’s circuital law. 
The dashed lines labelled 1, 2 and 3 are three circular Amperian loops. 

Figure 4.19(b) shows a sectional view of the toroid. The direction of the 








magnetic field inside is clockwise as per the right-hand thumb rule for 
circular loops. Three circular Amperian loops 1, 2 and 3 are shown by 
dashed lines. By symmetry, the magnetic field should be tangential to 
each of them and constant in magnitude for a given loop. The circular 
areas bounded by loops 2 and 3 both cut the toroid: so that each turn of 
current carrying wire is cut once by the loop 2 and twice by the loop 3. 

Let the magnetic field along loop 1 be B1 in magnitude. Then in Ampere’s 
circuital law [Eq. 4.17(a)], L = 2^. 

However, the loop encloses no current, so l e = 0. Thus, 


Bi (2 tt H) - M o(0), B 1 — 0 

Thus, the magnetic field at any point P in the open space inside the toroid 
is zero. 

We shall now show that magnetic field at Q is likewise zero. Let the 
magnetic field along loop 3 be B 3 . Once again from Ampere’s law L = 2 n 

r 3 . However, from the sectional cut, we see that the current coming out of 

the plane of the paper is cancelled exactly by the current going into it. 
Thus, l e = 0, and B 3 = 0. Let the magnetic field inside the solenoid be B. 

We shall now consider the magnetic field at S. Once again we employ 
Ampere’s law in the form of Eq. [4.17 (a)]. We find, L = 2 n r. 

The current enclosed l e is (for N turns of toroidal coil) N I. 


B (V) = m o ni 
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(4.21) 


We shall now compare the two results: for a toroid and solenoid. We re- 



express Eq. (4.21) to make the comparison easier with the solenoid 
result given in Eq. (4.20). Let r be the average radius of the toroid and n 
be the number of turns per unit length. Then 

N = 2^ n = (average) perimeter of the toroid 

x number of turns per unit length 
and thus, 

B = jjO n I, (4.22) 

i.e., the result for the solenoid! 

In an ideal toroid the coils are circular. In reality the turns of the toroidal 
coil form a helix and there is always a small magnetic field external to the 
toroid. 


Magnetic confinement 

We have seen in Section 4.3 (see also the box on helical motion of 
charged particles earlier in this chapter) that orbits of charged 
particles are helical. If the magnetic field is 

non-uniform, but does not change much during one circular orbit, 
then the radius of the helix will decrease as it enters stronger 
magnetic field and the radius will increase when it enters weaker 
magnetic fields. We consider two solenoids at a distance from each 
other, enclosed in an evacuated container (see figure below where 
we have not shown the container). Charged particles moving in the 
region between the two solenoids will start with a small radius. The 
radius will increase as field decreases and the radius will decrease 
again as field due to the second solenoid takes over. The solenoids 
act as a mirror or reflector. [See the direction of F as the particle 



approaches coil 2 in the figure. It has a horizontal component against 
the forward motion.] This makes the particles turn back when they 
approach the solenoid. Such an arrangement will act like magnetic 
bottle or magnetic container. The particles will never touch the sides 
of the container. Such magnetic bottles are of great use in confining 
the high energy plasma in fusion experiments. The plasma will 
destroy any other form of material container because of its high 
temperature. Another useful container is a toroid. Toroids are 
expected to play a key role in the tokamak, an equipment for plasma 
confinement in fusion power reactors. There is an international 
collaboration called the International Thermonuclear Experimental 
Reactor (ITER), being set up in France, for achieving controlled 
fusion, of which India is a collaborating nation. For details of ITER 
collaboration and the project, you may visithttp://www.iter.org. 
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Example 4.9 A solenoid of length 0.5 m has a radius of 1 cm and is 
made up of 500 turns. It carries a current of 5 A. What is the 












































magnitude of the magnetic field inside the solenoid? 

Solution The number of turns per unit length is, 

n = = 1000 turns/m 

0.5 

The length I = 0.5 m and radius r = 0.01 m. Thus, I/a = 50 i.e., I » a. 
Hence, we can use the long solenoid formula, namely, Eq. (4.20) 

B = gOn I 

= 4tt x 10-7 x 103 x 5 

= 6.28 x 10-3 T 


4.9 Force between Two Parallel Currents,the Ampere 

We have learnt that there exists a magnetic field due to a conductor 
carrying a current which obeys the Biot-Savart law. Further, we have 
learnt that an external magnetic field will exert a force on a current- 
carrying conductor. This follows from the Lorentz force formula. Thus, it is 
logical to expect that two current-carrying conductors placed near each 
other will exert (magnetic) forces on each other. In the period 1820-25, 
Ampere studied the nature of this magnetic force and its dependence on 
the magnitude of the current, on the shape and size of the conductors, as 
well as, the distances between the conductors. In this section, we shall 
take the simple example of two parallel current- carrying conductors, 
which will perhaps help us to appreciate Ampere’s painstaking work. 

Figure 4.20 shows two long parallel conductors a and b separated by a 
distance d and carrying (parallel) currents l a and l b , respectively. The 

conductor ‘a’ produces, the same magnetic field B a at all points along the 



conductor ‘b’. The right-hand rule tells us that the direction of this field is 
downwards (when the conductors are placed horizontally). Its magnitude 
is given by Eq. [4.19(a)] or from Ampere’s circuital law, 



Figure 4.20 Two long straight parallel conductors carrying steady currents la and lb and separated 
by a distance d. Ba is the magnetic field set up by conductor ‘a’at conductor ‘b’. 

B = 


The conductor ‘b’ carrying a current l b will experience a sideways force 

due to the field B a . The direction of this force is towards the conductor ‘a’ 

(Verify this). We label this force as F ba , the force on a segment L of ‘b’ 

due to ‘a’. The magnitude of this force is given by 
Eq. (4.4), 

^ba - E B a 


/'o ^ 1 

2nd 


L (4.23) 


It is of course possible to compute the force on ‘a’ due to ‘b’. From 





considerations similar to above we can find the force F ab , on a segment 
of length L of ‘a’ due to the current in ‘b’. It is equal in magnitude to F ba , 
and directed towards ‘b’. Thus, 

F ba = - F ab (4.24) 


Note that this is consistent with Newton’s third Law. Thus, at least for 
parallel conductors and steady currents, we have shown that the 
Biot-Savart law and the Lorentz force yield results in accordance with 
Newton’s third Law*. 

We have seen from above that currents flowing in the same direction 
attract each other. One can show that oppositely directed currents repel 
each other. Thus, 

Parallel currents attract, and antiparallel currents repel. 

This rule is the opposite of what we find in electrostatics. Like (same 
sign) charges repel each other, but like (parallel) currents attract each 
other. 


Let f ba represent the magnitude of the force F ba per unit length. Then, 
from Eq. (4.23), 



2 fi d 


(4.25) 


The above expression is used to define the ampere (A), which is one of 
the seven SI base units. 

The ampere is the value of that steady current which, when maintained in 
each of the two very long, straight, parallel conductors of negligible cross- 
section, and placed one metre apart in vacuum, would produce on each 

of these conductors a force equal to 2 * 10 -7 newtons per metre of 



length. 

This definition of the ampere was adopted in 1946. It is a theoretical 
definition. In practice, one must eliminate the effect of the earth’s 
magnetic field and substitute very long wires by multiturn coils of 
appropriate geometries. An instrument called the current balance is used 
to measure this mechanical force. 

The SI unit of charge, namely, the coulomb, can now be defined in terms 
of the ampere. 

When a steady current of 1A is set up in a conductor, the quantity of 
charge that flows through its cross-section in Is is one coulomb (1C). 

* It turns out that when we have time-dependent currents and/or charges 
in motion, Newton’s third law may not hold for forces between charges 
and/or conductors. An essential consequence of the Newton’s third law in 
mechanics is conservation of momentum of an isolated system. This, 
however, holds even for the case of time-dependent situations with 
electromagnetic fields, provided the momentum carried by fields is also 
taken into account. 


Roget’s spiral for attraction between parallel currents 

Magnetic effects are generally smaller than electric effects. As a 
consequence, the force between currents is rather small, because of 
the smallness of the factor M . Hence it is difficult to demonstrate 

attraction or repulsion between currents. Thus, for 5 A current in each 

wire at a separation of 1cm, the force per metre would be 5 * 10 -4 N, 
which is about 50 mg weight. It would be like pulling a wire by a string 
going over a pulley to which a 50 mg weight is attached. The 



displacement of the wire would be quite unnoticeable. 

With the use of a soft spring, we can increase the effective length of 
the parallel current and by using mercury, we can make the 
displacement of even a few mm observable very dramatically. You 
will also need a constant-current supply giving a constant current of 
about 5 A. 



Take a soft spring whose natural period of oscillations is about 0.5 - 
Is. Hang it vertically and attach a pointed tip to its lower end, as 
shown in the figure here. Take some mercury in a dish and adjust the 
spring such that the tip is just above the mercury surface. Take the 
DC current source, connect one of its terminals to the upper end of 
the spring, and dip the other terminal in mercury. If the tip of the 
spring touches mercury, the circuit is completed through mercury. 

Let the DC source be put off to begin with. Let the tip be adjusted so 
that it just touches the mercury surface. Switch on the constant 
current supply, and watch the fascinating outcome. The spring 
shrinks with a jerk, the tip comes out of mercury (just by a mm or so), 
the circuit is broken, the current stops, the spring relaxes and tries to 
come back to its original position, the tip again touches mercury 
establishing a current in the circuit, and the cycle continues with tick, 
tick, tick,... In the beginning, you may require some small adjustments 















to get a good effect. 

Keep your face away from mercury vapour as it is poisonous. Do not 
inhale mercury vapour for long. 


Example 4.10 The horizontal component of the earth’s magnetic field 
at a certain place is 3.0 x10-5 T and the direction of the field is from 
the geographic south to the geographic north. A very long straight 
conductor is carrying a steady current of 1A. What is the force per 
unit length on it when it is placed on a horizontal table and the 
direction of the current is (a) east to west; (b) south to north? 

Solution F = II x B 

F = MB sinG 

The force per unit length is 
f = F/l = I B sine 

(a) When the current is flowing from east to west, 

0 = 90° 

Hence, 
f = I B 

= 1 x 3 x 10-5 = 3 x 10-5 N m-1 

This is larger than the value 2x10-7 Nm-1 quoted in the definition of 
the ampere. Hence it is important to eliminate the effect of the earth’s 
magnetic field and other stray fields while standardising the ampere. 

The direction of the force is downwards. This direction may be 
obtained by the directional property of cross product of vectors. 






(b) When the current is flowing from south to north, 
0 = Oo 
f = 0 

Hence there is no force on the conductor. 


4.10 Torque on Current Loop, Magnetic Dipole 

4.10.1 Torque on a rectangular current loop in a uniform 
magnetic field 

We now show that a rectangular loop carrying a steady current I and 
placed in a uniform magnetic field experiences a torque. It does not 
experience a net force. This behaviour is analogous to that of electric 
dipole in a uniform electric field 
(Section 1.12). 

We first consider the simple case when the rectangular loop is placed 
such that the uniform magnetic field B is in the plane of the loop. This is 
illustrated in Fig. 4.21(a). 

The field exerts no force on the two arms AD and BC of the loop. It is 
perpendicular to the arm AB of the loop and exerts a force F 1 on it which 

is directed into the plane of the loop. Its magnitude is, 

F 1 = I b B 

Similarly, it exerts a force F 2 on the arm CD and F 2 is directed out of the 
plane of the paper. 

F 2 = I b B = F 1 
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Figure 4.21 (a) A rectangular current-carrying coil in uniform magnetic field. The magnetic 
moment mpoints downwards. The torque t is along the axis and tends to rotate the coil 
anticlockwise, (b) The couple acting on the coil. 

Thus, the net force on the loop is zero. There is a torque on the loop due 
to the pair of forces F 1 and F 2 . Figure 4.21(b) shows a view of the loop 

from the AD end. It shows that the torque on the loop tends to rotate it 
anticlockwise. This torque is (in magnitude), 























where A = ab is the area of the rectangle. 

We next consider the case when the plane of the loop, is not along the 
magnetic field, but makes an angle with it. We take the angle between 
the field and the normal to the coil to be angle e (The previous case 

corresponds to e = n /2). Figure 4.22 illustrates this general case. 

The forces on the arms BC and DA are equal, opposite, and act along the 
axis of the coil, which connects the centres of mass of BC and DA. Being 
collinear along the axis they cancel each other, resulting in no net force 
or torque. The forces on arms AB and CD are FI and F2. They too are 
equal and opposite, with magnitude, 

F 1 = F 2 = I b B 

But they are not collinear! This results in a couple as before. The torque 
is, however, less than the earlier case when plane of loop was along the 
magnetic field. This is because the perpendicular distance between the 
forces of the couple has decreased. Figure 4.22(b) is a view of the 
arrangement from the AD end and it illustrates these two forces 
constituting a couple. The magnitude of the torque on the loop is, 




fb) 

FIGURE 4.22 (a) The area vector of the loop ABCD makes an arbitrary angle 0 with the magnetic 
field, (b) Top view of the loop. The forces FI and F2 acting on the arms AB and CD are indicated. 

r = F. * sin 8+ F., ° sintf 
1 2 ~2 

= I ab B sin Q 
= I A B sin e (4.27) 

As e «• 0, the perpendicular distance between the forces of the couple 
also approaches zero. This makes the forces collinear and the net force 











and torque zero. The torques in Eqs. (4.26) and (4.27) can be expressed 
as vector product of the magnetic moment of the coil and the magnetic 
field. We define the magnetic moment of the current loop as, 

m = I A (4.28) 

where the direction of the area vector A is given by the right-hand thumb 
rule and is directed into the plane of the paper in Fig. 4.21. Then as the 
angle between m and B is e , Eqs. (4.26) and (4.27) can be expressed by 

one expression 
t = m ' B (4.29) 

This is analogous to the electrostatic case (Electric dipole of dipole 
moment p e in an electric field E). 

t=p e ' & 

As is clear from Eq. (4.28), the dimensions of the magnetic moment are 
[A][L 2 ] and its unit is Am 2 . 

From Eq. (4.29), we see that the torque T vanishes when m is either 

parallel or antiparallel to the magnetic field B. This indicates a state of 
equilibrium as there is no torque on the coil (this also applies to any 
object with a magnetic moment m). When m and B are parallel the 
equilibrium is a stable one. Any small rotation of the coil produces a 
torque which brings it back to its original position. When they are 
antiparallel, the equilibrium is unstable as any rotation produces a torque 
which increases with the amount of rotation. The presence of this torque 
is also the reason why a small magnet or any magnetic dipole aligns itself 
with the external magnetic field. 

If the loop has N closely wound turns, the expression for torque, Eq. 
(4.29), still holds, with 



m = N I A (4.30) 


Example 4.11 A 100 turn closely wound circular coil of radius 10 cm 
carries a current of 3.2 A. (a) What is the field at the centre of the 
coil? (b) What is the magnetic moment of this coil? 

The coil is placed in a vertical plane and is free to rotate about a 
horizontal axis which coincides with its diameter. A uniform magnetic 
field of 2T in the horizontal direction exists such that initially the axis 
of the coil is in the direction of the field. The coil rotates through an 
angle of 90° under the influence of the magnetic field. 

(c) What are the magnitudes of the torques on the coil in the initial 
and final position? (d) What is the angular speed acquired by the coil 
when it has rotated by 90°? The moment of inertia of the coil is 0.1 kg 
m2. 

Solution 

(a) From Eq. (4.16) 

b = £ vV[ 

2 R 

Here, N = 100; I = 3.2 A, and R = 0.1 m. Hence, 

4KxlO- 7 xlO a x3.2 _ 4 / IO~ 5 x 10 (using n x 3 . 2 = IQ) 
2x10“ 2x10 _1 

= 2 x 10-3 T 

The direction is given by the right-hand thumb rule. 

(b) The magnetic moment is given by Eq. (4.30), 

m = N I A = N I tt r2 = 100 x 3.2 x 3.14 x 10-2 = 10 A m2 







The direction is once again given by the right-hand thumb rule. 

(c) i = |m ' B| [from Eq. (4.29)] 

= m B sin 0 

Initially, 9 = 0. Thus, initial torque Ti = 0. Finally, 0 = tt/ 2 (or 90°). 
Thus, final torque Tf=mB = 10x2 = 20Nm. 

(d) From Newton’s second law, 

I = mfJ sin 0 
1 d£ 

where I is the moment of inertia of the coil. From chain rule, 

dry _ dry d& _ dry 
d£ d& dt dQ 

Using this, 

| 4 ry dry = niB sinO dtf 

Integrating from 9 = 0 to 0 = tt/2, 

“/ */2 


$ \ a) dor = mB j sin 0 dtf 


o 
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= -m B cos 0 |q / 2 = m B 


oj 




2 mB ^ 

1/2 

''2x20 s ! 
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Example 4.12 












(a) A current-carrying circular loop lies on a smooth horizontal plane. 
Can a uniform magnetic field be set up in such a manner that the 
loop turns around itself (i.e., turns about the vertical axis). 

(b) A current-carrying circular loop is located in a uniform external 
magnetic field. If the loop is free to turn, what is its orientation of 
stable equilibrium? Show that in this orientation, the flux of the total 
field (external field + field produced by the loop) is maximum. 

(c) A loop of irregular shape carrying current is located in an external 
magnetic field. If the wire is flexible, why does it change to a circular 
shape? 

Solution 

(a) No, because that would require t to be in the vertical direction. But 
t = I A x B, and since A of the horizontal loop is in the vertical 
direction, t would be in the plane of the loop for any B. 

(b) Orientation of stable equilibrium is one where the area vector A of 
the loop is in the direction of external magnetic field. In this 
orientation, the magnetic field produced by the loop is in the same 
direction as external field, both normal to the plane of the loop, thus 
giving rise to maximum flux of the total field. 

(c) It assumes circular shape with its plane normal to the field to 
maximise flux, since for a given perimeter, a circle encloses greater 
area than any other shape. 


4.10.2 Circular current loop as a magnetic dipole 

In this section, we shall consider the elementary magnetic element: the 
current loop. We shall show that the magnetic field (at large distances) 



due to current in a circular current loop is very similar in behaviour to the 
electric field of an electric dipole. In Section 4.6, we have evaluated the 
magnetic field on the axis of a circular loop, of a radius R, carrying a 
steady current I. The magnitude of this field is [(Eq. (4.15)], 


B 


u IR 
2x' 


and its direction is along the axis and given by the right-hand thumb rule 
(Fig. 4.12). Here, x is the distance along the axis from the centre of the 

loop. For x » R, we may drop the R 2 term in the denominator. Thus, 


B 


2 kx " 


Note that the area of the loop A = n R 2 . Thus, 

2kx 3 

As earlier, we define the magnetic moment m to have a magnitude IA, 


m = I A . Hence, 


A) 2m 
46 a ; 


Ao -> l/^o [431(a)] 

The expression of Eq. [4.31(a)] is very similar to an expression obtained 
earlier for the electric field of a dipole. The similarity may be seen if we 
substitute, 

m —» p 

B —» E (electrostatic dipole) 



(electrostatic field) 


4 K £\. X 

We then obtain, 

E«- 

4ne 0 x 

which is precisely the field for an electric dipole at a point on its axis, 
considered in Chapter 1, Section 1.10 [Eq. (1.20)]. 

It can be shown that the above analogy can be carried further. We had 
found in Chapter 1 that the electric field on the perpendicular bisector of 
the dipole is given by [See Eq.(1.21)], 

//,; —^ l / f-’.j 

where x is the distance from the dipole. If we replace p «■ m and 

x* 111 T~i 

B ~ , ; x » R in the above expression, we obtain the result 

4k x 3 

for B for a point in the plane of the loop at a distance x from the centre. 
For x »R, 


1 = y [4.31(b)] 

The results given by Eqs. [4.31(a)] and [4.31(b)] become exact for a point 
magnetic dipole. 

The results obtained above can be shown to apply to any planar loop: a 
planar current loop is equivalent to a magnetic dipole of dipole moment m 
= I A, which is the analogue of electric dipole moment p. Note, however, 
a fundamental difference: an electric dipole is built up of two elementary 
units — the charges (or electric monopoles). In magnetism, a magnetic 
dipole (or a current loop) is the most elementary element. The equivalent 
of electric charges, i.e., magnetic monopoles, are not known to exist. 



We have shown that a current loop (i) produces a magnetic field (see Fig. 
4.12) and behaves like a magnetic dipole at large distances, and 
(ii) is subject to torque like a magnetic needle. This led Ampere to 
suggest that all magnetism is due to circulating currents. This seems to 
be partly true and no magnetic monopoles have been seen so far. 
However, elementary particles such as an electron or a proton also carry 
an intrinsic magnetic moment, not accounted by circulating currents. 

4.10.3 The magnetic dipole moment of a revolving electron 

In Chapter 12 we shall read about the Bohr model of the hydrogen atom. 
You may perhaps have heard of this model which was proposed by the 
Danish physicist Niels Bohr in 1911 and was a stepping stone to a new 
kind of mechanics, namely, quantum mechanics. In the Bohr model, the 
electron (a negatively charged particle) revolves around a positively 
charged nucleus much as a planet revolves around the sun. The force in 
the former case is electrostatic (Coulomb force) while it is gravitational for 
the planet-Sun case. We show this Bohr picture of the electron in Fig. 
4.23. 

The electron of charge (-e) (e = + 1.6 x 10 -19 C) performs uniform 

circular motion around a stationary heavy nucleus of charge +Ze. This 
constitutes a current I, where, 

2 dr 

T= (4.32) 
v 


and T is the time period of revolution. Let r be the orbital radius of the 
electron, and v the orbital speed. Then, 




Figure 4.23 In the Bohr model of hydrogen-like atoms, the negatively charged electron is revolving 
with uniform speed around a centrally placed positively charged (+Z e) nucleus. The uniform 
circular motion of the electron constitutes a current. The direction of the magnetic moment is into 
the plane of the paper and is indicated separately by ®. 


/A ~ * {m r ur) (4.33) 

2rn 

t 

Substituting in Eq. (4.32), we have I = ev^r. 

There will be a magnetic moment, usually denoted by M |, associated with 
this circulating current. From Eq. (4.28) its magnitude is, M) = l n r 2 = evr/2. 

The direction of this magnetic moment is into the plane of the paper in 
Fig. 4.23. [This follows from the right-hand rule discussed earlier and the 
fact that the negatively charged electron is moving anticlockwise, leading 
to a clockwise current.] Multiplying and dividing the right-hand side of the 
above expression by the electron mass m e , we have, 





e 


l [4.34(a)] 

Here, I is the magnitude of the angular momentum of the electron about 
the central nucleus (“orbital” angular momentum). Vectorially, 

= [4 - 34(b)] 

1 2 rn e 

The negative sign indicates that the angular momentum of the electron is 
opposite in direction to the magnetic moment. Instead of electron with 
charge (-e), if we had taken a particle with charge (+q), the angular 
momentum and magnetic moment would be in the same direction. The 
ratio 


2k 

is called the gyromagnetic ratio and is a constant. Its value is 8.8 x 10 10 C 

/kg for an electron, which has been verified by experiments. 

The fact that even at an atomic level there is a magnetic moment, 
confirms Ampere’s bold hypothesis of atomic magnetic moments. This 
according to Ampere, would help one to explain the magnetic properties 
of materials. Can one assign a value to this atomic dipole moment? The 
answer is Yes. One can do so within the Bohr model. Bohr hypothesised 
that the angular momentum assumes a discrete set of values, namely, 
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h (4.36) 


where n is a natural number, n = 1, 2, 3, .... and h is a constant named 
after Max Planck (Planck’s constant) with a value h = 6.626 x 10 -34 J s. 

This condition of discreteness is called the Bohr quantisation condition. 
We shall discuss it in detail in Chapter 12. Our aim here is merely to use 
it to calculate the elementary dipole moment. Take the value n = 1, we 
have from Eq. (4.34) that, 

1.60 x 10“ 19 x 6,63 x 10“ 34 
4x3.14x9.11x1 Cr 31 


f- 


NAB 
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= 9.27 x 10- 24 Am 2 (4.37) 

where the subscript ‘min’ stands for minimum. This value is called the 
Bohr magneton. 

Any charge in uniform circular motion would have an associated 
magnetic moment given by an expression similar to Eq. (4.34). This 
dipole moment is labelled as the orbital magnetic moment. Hence, the 
subscript T in m! . Besides the orbital moment, the electron has an intrinsic 

magnetic moment, which has the same numerical value as given in Eq. 
(4.37). It is called the spin magnetic moment. But we hasten to add that it 
is not as though the electron is spinning. The electron is an elementary 
particle and it does not have an axis to spin around like a top or our 






earth. Nevertheless, it does possess this intrinsic magnetic moment. The 
microscopic roots of magnetism in iron and other materials can be traced 
back to this intrinsic spin magnetic moment. 

4.11 The Moving Coil Galvanometer 

Currents and voltages in circuits have been discussed extensively in 
Chapters 3. But how do we measure them? How do we claim that 
current in a circuit is 1.5 A or the voltage drop across a resistor is 1.2 V? 
Figure 4.24 exhibits a very useful instrument for this purpose: the moving 
coil galvanometer (MCG). It is a device whose principle can be 
understood on the basis of our discussion in Section 4.10. 

The galvanometer consists of a coil, with many turns, free to rotate about 
a fixed axis (Fig. 4.24), in a uniform radial magnetic field. There is a 
cylindrical soft iron core which not only makes the field radial but also 
increases the strength of the magnetic field. When a current flows 
through the coil, a torque acts on it. This torque is given by Eq. (4.26) to 
be 

T = Nl AB 

where the symbols have their usual meaning. Since the field is radial by 
design, we have taken sin e = 1 in the above expression for the torque. 

The magnetic torque NIAB tends to rotate the coil. Aspring S p provides a 

counter torque k^ that balances the magnetic torque NIAB; resulting in a 

steady angular deflection ^ In equilibrium 

k^ = Nl AB 

where k is the torsional constant of the spring; i.e. the restoring torque 
per unit twist. The deflection ^ is indicated on the scale by a pointer 


attached to the spring. We have 
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(4.38) 


The quantity in brackets is a constant for a given galvanometer. 

The galvanometer can be used in a number of ways. It can be used as a 
detector to check if a current is flowing in the circuit. We have come 
across this usage in the Wheatstone’s bridge arrangement. In this usage 
the neutral position of the pointer (when no current is flowing through the 
galvanometer) is in the middle of the scale and not at the left end as 
shown in Fig.4.24. Depending on the direction of the current, the pointer’s 
deflection is either to the right or the left. 

The galvanometer cannot as such be used as an ammeter to measure 
the value of the current in a given circuit. This is for two reasons: (i) 
Galvanometer is a very sensitive device, it gives a full-scale deflection for 
a current of the order of ^A. (ii) For measuring currents, the galvanometer 

has to be connected in series, and as it has a large resistance, this will 
change the value of the current in the circuit. To overcome these 
difficulties, one attaches a small resistance r s , called shunt resistance, in 

parallel with 

the galvanometer coil; so that most of the current passes through the 
shunt. The resistance of this arrangement is, 



Scale 



Uniform radial 
magnetic field 


Figure 4.24 The moving coil galvanometer. Its elements are described in the text. Depending on 
the requirement, this device can be used as a current detector or for measuring the value of the 

current (ammeter) or voltage (voltmeter). 

R G r s/ ( R G + r s) i r s if R G >>r s 

If r s has small value, in relation to the resistance of the rest of the circuit 

R c , the effect of introducing the measuring instrument is also small and 

negligible. This arrangement is schematically shown in Fig. 4.25. The 
scale of this ammeter is calibrated and then graduated to read off the 
current value with ease. We define the current sensitivity of the 
galvanometer as the deflection per unit current. From Eq. (4.38) this 
current sensitivity is, 
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(4.39) 


A convenient way for the manufacturer to increase the sensitivity is to 
increase the number of turns N. We choose galvanometers having 
sensitivities of value, required by our experiment. 

The galvanometer can also be used as a voltmeter to measure the 
voltage across a given section of the circuit. For this it must be connected 
in parallel with that section of the circuit. Further, it must draw a very 
small current, otherwise the voltage measurement will disturb the original 
set up by an amount which is very large. Usually we like to keep the 
disturbance due to the measuring device below one per cent. To ensure 
this, a large resistance R is connected in series with the galvanometer. 
This arrangement is schematically depicted in Fig.4.26. Note that the 
resistance of the voltmeter is now, 


R g + R ; R : large 


The scale of the voltmeter is calibrated to read off the voltage value with 
ease. We define the voltage sensitivity as the deflection per unit voltage. 
From Eq. (4.38), 
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Figure 4.25 Conversion of a galvanometer (G) to an ammeter by the introduction of a shunt 

resistance rs of very small value in parallel. 
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Figure 4.26 Conversion of a galvanometer (G) to a voltmeter by the introduction of a 

resistance R of large 
value in series. 
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(4.40) 


An interesting point to note is that increasing the current sensitivity may 
not necessarily increase the voltage sensitivity. Let us take Eq. (4.39) 
which provides a measure of current sensitivity. If N 2N, i.e., we double 

the number of turns, then 

0 0 

Thus, the current sensitivity doubles. However, the resistance of the 
galvanometer is also likely to double, since it is proportional to the length 
of the wire. In Eq. (4.40), N ^2N, and R _^2R, thus the voltage sensitivity, 


R, + 3 = 63 Q 

remains unchanged. So in general, the modification needed for 
conversion of a galvanometer to an ammeter will be different from what is 
needed for converting it into a voltmeter. 
















Example 4.13 In the circuit (Fig. 4.27) the current is to be 
measured. What is the value of the current if the ammeter shown 

(a) is a galvanometer with a resistance RG = 60.00 Q; (b) is a 
galvanometer described in (a) but converted to an ammeter by a 
shunt resistance rs = 0.02 Q; (c) is an ideal ammeter with zero 
resistance? 

I-►- A -1 

3.00n 1 ^ 


3.00 V 

Figure 4.27 

Solution 


(a) Total resistance in the circuit is, 


r s 

+r s 


60 Q x 0.020 
(60 + 0.02)Q 


. Hence, I = 3/63 = 0.048 A. 


(b) Resistance of the galvanometer converted to an ammeter is, 


0.020 - 3 U = 3.02 U ; 0.02 o 


Total resistance in the circuit is, 

. Hence, I = 3/3.02 = 0.99 A. 

(c) For the ideal ammeter with zero resistance, 
1 = 3/3 = 1.00 A 












Summary 

1. The total force on a charge q moving with velocity v in the 
presence of magnetic and electric fieldsB and E, respectively is 
called the Lorentz force. It is given by the expression: 

F = q (v x B + E) 

The magnetic force q (v x B) is normal to v and work done by it is 
zero. 

2. A straight conductor of length I and carrying a steady 
current I experiences a force F in a uniform external magnetic field B, 

F = 11 x B 

where|l| = I and the direction of I is given by the direction of the 
current. 

3. In a uniform magnetic field B, a charge q executes a circular orbit 
in a plane normal to B. Its frequency of uniform circular motion is 
called the cyclotron frequency and is given by: 

2 7i m 

This frequency is independent of the particle’s speed and radius. This 
fact is exploited in a machine, the cyclotron, which is used to 
accelerate charged particles. 

4. The Biot-Savart law asserts that the magnetic field dB due to an 
element dl carrying a steady current I at a point P at a distance r from 
the current element is: 




dB - ' ,J I ' 

r" 

To obtain the total field at P, we must integrate this vector expression 
over the entire length of the conductor. 

5. The magnitude of the magnetic field due to a circular coil of 
radius R carrying a current I at an axial distance x from the centre is 

B A= ^ 

2(.v ' i R ■ ■) " "■' 

At the centre this reduces to 

2 R 

6. Ampere’s Circuital Law: Let an open surface S be bounded by a 

loop C. Then the Ampere’s law states that <j)B.dl=jU 0 J 

c 

where I refers to the current passing through S. The sign of I is 
determined from the right-hand rule. We have discussed a simplified 
form of this law. If B is directed along the tangent to every point on 
the perimeter L of a closed curve and is constant in magnitude along 
perimeter then, 

BL = pO l e 

where l e is the net current enclosed by the closed circuit. 

7. The magnitude of the magnetic field at a distance R from a long, 
straight wire carrying a current lis given by: 

11 = /A|J 
2 a r 

The field lines are circles concentric with the wire. 




8. The magnitude of the field B inside a long solenoid carrying a 
current I is 

B = |jOnl 

where n is the number of turns per unit length. For a toroid one 
obtains, 

2 nr 

where N is the total number of turns and r is the average radius. 

9. Parallel currents attract and anti-parallel currents repel. 

10. A planar loop carrying a current I, having N closely wound turns, 
and an area A possesses a magnetic moment m where, 

m = N I A 

and the direction of m is given by the right-hand thumb rule : curl the 
palm of your right hand along the loop with the fingers pointing in the 
direction of the current. The thumb sticking out gives the direction 
of m (and A) 

When this loop is placed in a uniform magnetic field B, the force F on 
it is: F = 0 

And the torque on it is, 
t = m x B 

In a moving coil galvanometer, this torque is balanced by a counter¬ 
torque due to a spring, yielding 

kcp = Nl AB 

where (p is the equilibrium deflection and k the torsion constant of the 
spring. 




11. An electron moving around the central nucleus has a magnetic 
moment |j| given by: 


where I is the magnitude of the angular momentum of the circulating 
electron about the central nucleus. The smallest value of p, is called 

the Bohr magneton |jB and it is |jB = 9.27x10-24 J/T 

12. A moving coil galvanometer can be converted into a ammeter by 
introducing a shunt resistance r s ,of small value in parallel. It can be 

converted into a voltmeter by introducing a resistance of a large value 
in series. 
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Points to Ponder 

1. Electrostatic field lines originate at a positive charge and terminate 
at a negative charge or fade at infinity. Magnetic field lines always 
form 

closed loops. 

2. The discussion in this Chapter holds only for steady currents which 
do not vary with time. 

When currents vary with time Newton’s third law is valid only if 









momentum carried by the electromagnetic field is taken into account. 

3. Recall the expression for the Lorentz force, 

F = q (v x B + E) 

This velocity dependent force has occupied the attention of some of 
the greatest scientific thinkers. If one switches to a frame with 
instantaneous velocity v, the magnetic part of the force vanishes. The 
motion of the charged particle is then explained by arguing that there 
exists an appropriate electric field in the new frame. We shall not 
discuss the details of this mechanism. However, we stress that the 
resolution of this paradox implies that electricity and magnetism are 
linked phenomena (electromagnetism) and that the Lorentz force 
expression does not imply a universal preferred frame of reference in 
nature. 

4. Ampere’s Circuital law is not independent of the Biot-Savart law. It 
can be derived from the Biot-Savart law. Its relationship to the 
Biot-Savart law is similar to the relationship between Gauss’s law and 
Coulomb’s law. 


Exercises 

4.1 A circular coil of wire consisting of 100 turns, each of radius 8.0 
cm carries a current of 0.40 A. What is the magnitude of the magnetic 
field B at the centre of the coil? 

4.2 A long straight wire carries a current of 35 A. What is the 
magnitude of the field B at a point 20 cm from the wire? 

4.3 A long straight wire in the horizontal plane carries a current of 50 






A in north to south direction. Give the magnitude and direction of B at 
a point 2.5 m east of the wire. 

4.4 A horizontal overhead power line carries a current of 90 A in east 
to west direction. What is the magnitude and direction of the 
magnetic field due to the current 1.5 m below the line? 

4.5 What is the magnitude of magnetic force per unit length on a wire 
carrying a current of 8 A and making an angle of 30° with the direction 
of a uniform magnetic field of 0.15 T? 

4.6 A 3.0 cm wire carrying a current of 10 A is placed inside a 
solenoid perpendicular to its axis. The magnetic field inside the 
solenoid is given to be 0.27 T. What is the magnetic force on the 
wire? 

4.7 Two long and parallel straight wires A and B carrying currents of 
8.0 A and 5.0 A in the same direction are separated by a distance of 
4.0 cm. Estimate the force on a 10 cm section of wire A. 

4.8 A closely wound solenoid 80 cm long has 5 layers of windings of 
400 turns each. The diameter of the solenoid is 1.8 cm. If the current 
carried is 8.0 A, estimate the magnitude of B inside the solenoid near 
its centre. 

4.9 A square coil of side 10 cm consists of 20 turns and carries a 
current of 12 A. The coil is suspended vertically and the normal to the 
plane of the coil makes an angle of 30° with the direction of a uniform 
horizontal magnetic field of magnitude 0.80 T. What is the magnitude 
of torque experienced by the coil? 

4.10 Two moving coil meters, Ml and M2 have the following 
particulars: 

R1 = 10 Q, N1 = 30, 




A1 = 3.6 x 10-3 m2, B1 = 0.25 T 

R2 = 14 Q, N2 = 42, 

A2 = 1.8 x 10-3 m2, B2 = 0.50 T 

(The spring constants are identical for the two meters). 

Determine the ratio of (a) current sensitivity and (b) voltage sensitivity 
of M2 and Ml. 

4.11 In a chamber, a uniform magnetic field of 6.5 G (1 G = 10-4 T) is 
maintained. An electron is shot into the field with a speed of 

4.8 x 106 m s-1 normal to the field. Explain why the path of the 
electron is a circle. Determine the radius of the circular orbit. 

(e = 1.5 x 10-19 C, me = 9.1x10-31 kg) 

4.12 In Exercise 4.11 obtain the frequency of revolution of the 
electron in its circular orbit. Does the answer depend on the speed of 
the electron? Explain. 

4.13 (a) A circular coil of 30 turns and radius 8.0 cm carrying a 
current of 6.0 A is suspended vertically in a uniform horizontal 
magnetic field of magnitude 1.0 T. The field lines make an angle of 
60° with the normal of the coil. Calculate the magnitude of the 
counter torque that must be applied to prevent the coil from turning. 

(b) Would your answer change, if the circular coil in (a) were replaced 
by a planar coil of some irregular shape that encloses the same 
area? (All other particulars are also unaltered.) 


Additional Exercises 

4.14 Two concentric circular coils X and Y of radii 16 cm and 10 cm, 






respectively, lie in the same vertical plane containing the north to 
south direction. Coil X has 20 turns and carries a current of 16 A; coil 
Y has 25 turns and carries a current of 18 A. The sense of the current 
in X is anticlockwise, and clockwise in Y, for an observer looking at 
the coils facing west. Give the magnitude and direction of the net 
magnetic field due to the coils at their centre. 

4.15 A magnetic field of 100 G (1 G = 10-4 T) is required which is 
uniform in a region of linear dimension about 10 cm and area of 
cross-section about 10-3 m2. The maximum current-carrying 
capacity of a given coil of wire is 15 A and the number of turns per 
unit length that can be wound round a core is at most 1000 turns m- 
1. Suggest some appropriate design particulars of a solenoid for the 
required purpose. Assume the core is not ferromagnetic. 

4.16 For a circular coil of radius R and N turns carrying current I, the 
magnitude of the magnetic field at a point on its axis at a distance x 
from its centre is given by, 

L> — 

2|V -R 2 ) 

(a) Show that this reduces to the familiar result for field at the centre 
of the coil. 


(b) Consider two parallel co-axial circular coils of equal radius R, and 
number of turns N, carrying equal currents in the same direction, and 
separated by a distance R. Show that the field on the axis around the 
mid-point between the coils is uniform over a distance that is small as 
compared to R, and is given by, 


B = 0.72 


A: W 

R 


, approximately. 




[Such an arrangement to produce a nearly uniform magnetic field 
over a small region is known as Helmholtz coils.] 

4.17 A toroid has a core (non-ferromagnetic) of inner radius 25 cm 
and outer radius 26 cm, around which 3500 turns of a wire are 
wound. If the current in the wire is 11 A, what is the magnetic field 

(a) outside the toroid, (b) inside the core of the toroid, and (c) in the 
empty space surrounded by the toroid. 

4.18 Answer the following questions: 

(a) A magnetic field that varies in magnitude from point to point but 
has a constant direction (east to west) is set up in a chamber. A 
charged particle enters the chamber and travels undeflected along a 
straight path with constant speed. What can you say about the initial 
velocity of the particle? 

(b) A charged particle enters an environment of a strong and 
non-uniform magnetic field varying from point to point both in 
magnitude and direction, and comes out of it following a complicated 
trajectory. Would its final speed equal the initial speed if it suffered no 
collisions with the environment? 

(c) An electron travelling west to east enters a chamber having a 
uniform electrostatic field in north to south direction. Specify the 
direction in which a uniform magnetic field should be set up to 
prevent the electron from deflecting from its straight line path. 

4.19 An electron emitted by a heated cathode and accelerated 
through a potential difference of 2.0 kV, enters a region with uniform 
magnetic field of 0.15 T. Determine the trajectory of the electron if the 
field 

(a) is transverse to its initial velocity, (b) makes an angle of 30° with 
the initial velocity. 




4.20 A magnetic field set up using Helmholtz coils (described in 
Exercise 4.16) is uniform in a small region and has a magnitude of 
0.75 T. In the same region, a uniform electrostatic field is maintained 
in a direction normal to the common axis of the coils. A narrow beam 
of (single species) charged particles all accelerated through 15 kV 
enters this region in a direction perpendicular to both the axis of the 
coils and the electrostatic field. If the beam remains undeflected 
when the electrostatic field is 9.0 * 10-5 V m-1, make a simple 
guess as to what the beam contains. Why is the answer not unique? 

4.21 A straight horizontal conducting rod of length 0.45 m and mass 
60 g is suspended by two vertical wires at its ends. A current of 5.0 A 
is set up in the rod through the wires. 

(a) What magnetic field should be set up normal to the conductor in 
order that the tension in the wires is zero? 

(b) What will be the total tension in the wires if the direction of current 
is reversed keeping the magnetic field same as before? (Ignore the 
mass of the wires.) g = 9.8 m s-2. 

4.22 The wires which connect the battery of an automobile to its 
starting motor carry a current of 300 A (for a short time). What is the 
force per unit length between the wires if they are 70 cm long and 1.5 
cm apart? Is the force attractive or repulsive? 

4.23 A uniform magnetic field of 1.5 T exists in a cylindrical region of 
radiuslO.O cm, its direction parallel to the axis along east to west. A 
wire carrying current of 7.0 A in the north to south direction passes 
through this region. What is the magnitude and direction of the force 
on the wire if, 

(a) the wire intersects the axis, 




(b) the wire is turned from N-S to northeast-northwest direction, 

(c) the wire in the N-S direction is lowered from the axis by a distance 
of 6.0 cm? 

4.24 A uniform magnetic field of 3000 G is established along the 
positive z-direction. A rectangular loop of sides 10 cm and 5 cm 
carries a current of 12 A. What is the torque on the loop in the 
different cases shown in Fig. 4.28? What is the force on each case? 
Which case corresponds to stable equilibrium? 



Figure 4.28 

4.25 A circular coil of 20 turns and radius 10 cm is placed in a uniform 
magnetic field of 0.10 T normal to the plane of the coil. If the current 
in the coil is 5.0 A, what is the 

(a) total torque on the coil, 

(b) total force on the coil, 













(c) average force on each electron in the coil due to the magnetic 
field? 

(The coil is made of copper wire of cross-sectional area 10-5 m2, 
and the free electron density in copper is given to be about 
1029 m-3.) 

4.26 A solenoid 60 cm long and of radius 4.0 cm has 3 layers of 
windings of 300 turns each. A 2.0 cm long wire of mass 2.5 g lies 
inside the solenoid (near its centre) normal to its axis; both the wire 
and the axis of the solenoid are in the horizontal plane. The wire is 
connected through two leads parallel to the axis of the solenoid to an 
external battery which supplies a current of 6.0 A in the wire. What 
value of current (with appropriate sense of circulation) in the windings 
of the solenoid can support the weight of the wire? g = 9.8 m s-2. 

4.27 A galvanometer coil has a resistance of 12 Q and the metre 
shows full scale deflection for a current of 3 mA. How will you convert 
the metre into a voltmeter of range 0 to 18 V? 

4.28 A galvanometer coil has a resistance of 15 Q and the metre 
shows full scale deflection for a current of 4 mA. How will you convert 
the metre into an ammeter of range 0 to 6 A? 
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Chapter Five 


Magnetism and Matter 


5.1 Introduction 

Magnetic phenomena are universal in nature. Vast, distant galaxies, the 
tiny invisible atoms, humans and beasts all are permeated through and 
through with a host of magnetic fields from a variety of sources. The 
earth’s magnetism predates human evolution. The word magnet is 
derived from the name of an island in Greece called magnesia where 
magnetic ore deposits were found, as early as 600 BC. Shepherds on 
this island complained that their wooden shoes (which had nails) at times 
stayed struck to the ground. Their iron-tipped rods were similarly 
affected. This attractive property of magnets made it difficult for them to 
move around. 

The directional property of magnets was also known since ancient times. 
A thin long piece of a magnet, when suspended freely, pointed in the 
north-south direction. A similar effect was observed when it was placed 
on a piece of cork which was then allowed to float in still water. The name 
lodestone (or loadstone) given to a naturally occurring ore of iron- 
magnetite means leading stone. The technological exploitation of this 
property is generally credited to the Chinese. Chinese texts dating 400 
BC mention the use of magnetic needles for navigation on ships. 
Caravans crossing the Gobi desert also employed magnetic needles. 

A Chinese legend narrates the tale of the victory of the emperor Huang-ti 
about four thousand years ago, which he owed to his craftsmen (whom 
nowadays you would call engineers). These ‘engineers’ built a chariot on 




which they placed a magnetic figure with arms outstretched. Figure 5.1 is 
an artist’s description of this chariot. The figure swiveled around so that 
the finger of the statuette on it always pointed south. With this chariot, 
Huang-ti’s troops were able 

to attack the enemy from the rear in thick fog, and to defeat them. 



Figure 5.1 The arm of the statuette mounted on the chariot always points south. This is an artist’s 
sketch of one of the earliest known compasses, thousands of years old. 

In the previous chapter we have learned that moving charges or electric 
currents produce magnetic fields. This discovery, which was made in the 
early part of the nineteenth century is credited to Oersted, Ampere, Biot 
and Savart, among others. In the present chapter, we take a look at 
magnetism as a subject in its own right. 

Some of the commonly known ideas regarding magnetism are: 

(i) The earth behaves as a magnet with the magnetic field pointing 
approximately from the geographic south to the north. 














(ii) When a bar magnet is freely suspended, it points in the north-south 
direction. The tip which points to the geographic north is called the north 
pole and the tip which points to the geographic south is called the south 
pole of the magnet. 

(iii) There is a repulsive force when north poles ( or south poles ) of two 
magnets are brought close together. Conversely, there is an attractive 
force between the north pole of one magnet and the south pole of 

the other. 

(iv) We cannot isolate the north, or south pole of a magnet. If a bar 
magnet is broken into two halves, we get two similar bar magnets with 
somewhat weaker properties. Unlike electric charges, isolated magnetic 
north and south poles known as magnetic monopoles do not exist. 

(v) It is possible to make magnets out of iron and its alloys. 

We begin with a description of a bar magnet and its behaviour in an 
external magnetic field. We describe Gauss’s law of magnetism. We then 
follow it up with an account of the earth’s magnetic field. We next 
describe how materials can be classified on the basis of their magnetic 
properties. We describe para-, dia-, and ferromagnetism. We conclude 
with a section on electromagnets and permanent magnets. 

5.2 The Bar Magnet 

One of the earliest childhood memories of the famous physicist Albert 
Einstein was that of a magnet gifted to him by a relative. Einstein was 
fascinated, and played endlessly with it. He wondered how the magnet 
could affect objects such as nails or pins placed away from it and not in 
any way connected to it by a spring or string. 

We begin our study by examining iron filings sprinkled on a sheet of glass 
placed over a short bar magnet. The arrangement of iron filings is shown 


in Fig. 5.2. 

The pattern of iron filings suggests that the magnet has two poles similar 
to the positive and negative charge of an electric dipole. As mentioned in 
the introductory section, one pole is designated the North pole and the 
other, the South pole. When suspended freely, these poles point 
approximately towards the geographic north and south poles, 
respectively. A similar pattern of iron filings is observed around a current 
carrying solenoid. 



Figure 5.2 The arrangement of iron filings surrounding a bar magnet. The pattern mimics 
magnetic field lines. The pattern suggests that the bar magnet is a magnetic dipole. 


5.2.1 The magnetic field lines 

The pattern of iron filings permits us to plot the magnetic field lines*. This 
is shown both for the bar-magnet and the current-carrying solenoid in 
Fig. 5.3. For comparison refer to the Chapter 1, Figure 1.17(d). Electric 
field lines of an electric dipole are also displayed in Fig. 5.3(c). The 
magnetic field lines are a visual and intuitive realisation of the magnetic 
field. Their properties are: 



(i) The magnetic field lines of a magnet (or a solenoid) form continuous 
closed loops. This is unlike the electric dipole where these field lines 
begin from a positive charge and end on the negative charge or escape 
to infinity. 

(ii) The tangent to the field line at a given point represents the direction of 
the net magnetic field B at that point. 





Figure 5.3 The field lines of (a) a bar magnet, (b) a current-carrying finite solenoid and (c) electric 
dipole. At large distances, the field lines are very similar. The curves labelled i and ii are closed 
Gaussian surfaces. 

* In some textbooks the magnetic field lines are called magnetic lines of 
force. This nomenclature is avoided since it can be confusing. Unlike 
electrostatics the field lines in magnetism do not indicate the direction of 
the force on a (moving) charge. 

(iii) The larger the number of field lines crossing per unit area, the 
stronger is the magnitude of the magnetic field B. In Fig. 5.3(a), B is 
larger around region ii than in region i. 

(iv) The magnetic field lines do not intersect, for if they did, the direction 


















































of the magnetic field would not be unique at the point of intersection. 

One can plot the magnetic field lines in a variety of ways. One way is to 
place a small magnetic compass needle at various positions and note its 
orientation. This gives us an idea of the magnetic field direction at various 
points in space. 

5.2.2 Bar magnet as an equivalent solenoid 

In the previous chapter, we have explained how a current loop acts as a 
magnetic dipole (Section 4.10). We mentioned Ampere’s hypothesis that 
all magnetic phenomena can be explained in terms of circulating 
currents. Recall that the magnetic dipole moment m associated with a 
current loop was defined to be m = NIA where N is the number of turns in 
the loop, I the current and A the area vector (Eq. 4.30). 





dx 



H-1-► 


(a) 



(b) 

Figure 5.4 Calculation of (a) The axial field of a finite solenoid in order to demonstrate its similarity 
to that of a bar magnet, (b) A magnetic needle in a uniform magnetic field B. The arrangement 
may be used to determine either B or the magnetic moment m of the needle. 

The resemblance of magnetic field lines for a bar magnet and a solenoid 
suggest that a bar magnet may be thought of as a large number of 
circulating currents in analogy with a solenoid. Cutting a bar magnet in 
half is like cutting a solenoid. We get two smaller solenoids with weaker 
magnetic properties. The field lines remain continuous, emerging from 
one face of the solenoid and entering into the other face. One can test 
this analogy by moving a small compass needle in the neighbourhood of 
a bar magnet and a current-carrying finite solenoid and noting that the 
deflections of the needle are similar in both cases. 




















To make this analogy more firm we calculate the axial field of a finite 
solenoid depicted in Fig. 5.4 (a). We shall demonstrate that at large 
distances this axial field resembles that of a bar magnet. 

Let the solenoid of Fig. 5.4(a) consists of n turns per unit length. Let its 
length be 21 and radius a. We can evaluate the axial field at a point P, at a 
distance r from the centre O of the solenoid. To do this, consider a 
circular element of thickness dx of the solenoid at a distance x from its 
centre. It consists of n dx turns. Let I be the current in the solenoid. In 
Section 4.6 of the previous chapter we have calculated the magnetic field 
on the axis of a circular current loop. From Eq. (4.13), the magnitude of 
the field at point P due to the circular element is 

dB = ^ r,d - vJq2 „ 

2[(r-xf + a 2 f A 


The magnitude of the total field is obtained by summing over all the 
elements — in other words by integrating from x = - I to x = + I. Thus, 


B ~ 


u (: nl(T 

2 


i:, 


[(/■ 


dx 

xf+a 2 ] 3/2 


This integration can be done by trigonometric substitutions. This 
exercise, however, is not necessary for our purpose. Note that the range 
of x is from - I to + I. Consider the far axial field of the solenoid, i.e., 
r» a and r» I. Then the denominator is approximated by 


and 


B = 


n I a: 
2 r 3 



//., n / 2la A 


(5.1) 



Note that the magnitude of the magnetic moment of the solenoid is, m = 

n (21) I (ua 2 ) — (total number of turns * current * cross-sectional area). 
Thus, 


B - 


u y , 2m 
A/r r 3 


(5.2) 


This is also the far axial magnetic field of a bar magnet which one may 
obtain experimentally. Thus, a bar magnet and a solenoid produce similar 
magnetic fields. The magnetic moment of a bar magnet is thus equal to 
the magnetic moment of an equivalent solenoid that produces the same 
magnetic field. 

Some textbooks assign a magnetic charge (also called pole strength) 
+q m to the north pole and -q m to the south pole of a bar magnet of length 

21, and magnetic moment q m (2l). The field strength due to q m at a 

distance r from it is given by p 0 q m /4nr 2 . The magnetic field due to the bar 

magnet is then obtained, both for the axial and the equatorial case, in a 
manner analogous to that of an electric dipole (Chapter 1). The method is 
simple and appealing. However, magnetic monopoles do not exist, and 
we have avoided this approach for that reason. 


5.2.3 The dipole in a uniform magnetic field 

The pattern of iron filings, i.e., the magnetic field lines gives us an 
approximate idea of the magnetic field B. We may at times be required to 
determine the magnitude of B accurately. This is done by placing a small 
compass needle of known magnetic moment m and moment of inertia I 
and allowing it to oscillate in the magnetic field. This arrangement is 
shown in Fig. 5.4(b). 

The torque on the needle is [see Eq. (4.29)], 



t = m x B (5.3) 

In magnitude t = mB sin0 

Here t is restoring torque and 0 is the angle between m and B. 

Therefore, in equilibrium / ^ ^ - mB sin & 

dt 

Negative sign with mB sin0 implies that restoring torque is in opposition 
to deflecting torque. For small values of 0 in radians, we approximate 
sin 0 « 0 and get 

.-7 J n 

l ' « -mUO 

dr 


or, 


dt z 


mB 

T~ 
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This represents a simple harmonic motion. The square of the angular 
frequency is w 2 = mB/l and the time period is, 

i- 

T = 2xJ ' (5.4) 

VmB 

4 f 

or fJ = —^r (5.5) 

m T" 


An expression for magnetic potential energy can also be obtained on 
lines similar to electrostatic potential energy. 

The magnetic potential energy U m is given by 

U m = | t(q)dq 

= JmBsingdg =-mB cos g 



m.B 


(5.6) 


We have emphasised in Chapter 2 that the zero of potential energy can 
be fixed at one’s convenience. Taking the constant of integration to be 
zero means fixing the zero of potential energy at 0 = 90°, i.e., when the 
needle is perpendicular to the field. Equation (5.6) shows that potential 
energy is minimum (= -mB) at 0 = 0° (most stable position) and 
maximum (= +mB) at 0 = 180° (most unstable position). 


Example 5.1 In Fig. 5.4(b), the magnetic needle has magnetic 

moment 6.7 x io -2 Am 2 and moment of inertia I = 7.5 x io -6 kg m 2 . It 
performs 10 complete oscillations in 6.70 s. What is the magnitude of 
the magnetic field? 

Solution The time period of oscillation is, 

^ G.70 r , 

T ~ ~ O.b/s 

10 

From Eq. (5.5) 

b = ^L 
mT 

_ 4 x (0.1 I f x 7.5x10 n 
0.7 x I 0 :: x [0,07j 2 

= 0.01 T 


Example 5.2 A short bar magnet placed with its axis at 30° with an 
external field of 800 G experiences a torque of 0.016 Nm. (a) What is 





the magnetic moment of the magnet? (b) What is the work done in 
moving it from its most stable to most unstable position? (c) The bar 

magnet is replaced by a solenoid of cross-sectional area 2 x 10 -4 m 2 
and 1000 turns, but of the same magnetic moment. Determine the 
current flowing through the solenoid. 

Solution 

(a) From Eq. (5.3), t = m B sin 0, 0 = 30°, hence sin0 =1/2. 

Thus, 0.016 = m x (800 x icr 4 T) x (1/2) 

m = 160 x 2/800 = 0.40 Am 2 

(b) From Eq. (5.6), the most stable position is 0 = 0° and the most 
unstable position is 0 = 180°. Work done is given by 

W = Li^iO = ISO 11 ) - UJO = 0°) 

= 2 m B = 2 x 0.40 x 800 x 10” 4 = 0.064 J 

(c) From Eq. (4.30), ms = NIA. From part (a), ms = 0.40 A m 2 
0.40 = 1000 x | x 2 x io- 4 

I = 0.40 x 10 4 /(1000 x 2) = 2A 


Example 5.3 

(a) What happens if a bar magnet is cut into two pieces: (i) transverse 
to its length, (ii) along its length? 

(b) A magnetised needle in a uniform magnetic field experiences a 
torque but no net force. An iron nail near a bar magnet, however, 
experiences a force of attraction in addition to a torque. Why? 






(c) Must every magnetic configuration have a north pole and a south 
pole? What about the field due to a toroid? 

(d) Two identical looking iron bars A and B are given, one of which is 
definitely known to be magnetised. (We do not know which one.) How 
would one ascertain whether or not both are magnetised? If only one 
is magnetised, how does one ascertain which one? [Use nothing else 
but the bars A and B.] 

Solution 

(a) In either case, one gets two magnets, each with a north and south 
pole. 

(b) No force if the field is uniform. The iron nail experiences a non- 
uniform field due to the bar magnet. There is induced magnetic 
moment in the nail, therefore, it experiences both force and torque. 
The net force is attractive because the induced south pole (say) in 
the nail is closer to the north pole of magnet than induced north pole. 

(c) Not necessarily. True only if the source of the field has a net non¬ 
zero magnetic moment. This is not so for a toroid or even for a 
straight infinite conductor. 

(d) Try to bring different ends of the bars closer. A repulsive force in 
some situation establishes that both are magnetised. If it is always 
attractive, then one of them is not magnetised. In a bar magnet the 
intensity of the magnetic field is the strongest at the two ends (poles) 
and weakest at the central region. This fact may be used to 
determine whether A or B is the magnet. In this case, to see which 

one of the two bars is a magnet, pick up one, (say, A) and lower one 
of its ends; first on one of the ends of the other (say, B), and then on 
the middle of B. If you notice that in the middle of B, A experiences no 




force, then B is magnetised. If you do not notice any change from the 
end to the middle of B, then A is magnetised. 


5.2.4 The electrostatic analog 

Comparison of Eqs. (5.2), (5.3) and (5.6) with the corresponding 
equations for electric dipole (Chapter 1), suggests that magnetic field at 
large distances due to a bar magnet of magnetic moment m can be 
obtained from the equation for electric field due to an electric dipole of 
dipole moment p, by making the following replacements: 


E 



m 


i ^o 

4t zi). 4 tz 


In particular, we can write down the equatorial field (B E ) of a bar magnet 
at a distance r, for r» I, where I is the size of the magnet: 



a,m 
4 k r ' 


(5.7) 


Likewise, the axial field (B A ) of a bar magnet for r» I is: 


//,: 2m 
4 k r" 


(5.8) 


Equation (5.8) is just Eq. (5.2) in the vector form. Table 5.1 summarises 
the analogy between electric and magnetic dipoles. 








Table 5.1 The dipole analogy 


Electrostatics 

Magnetism 




Dipole moment 

p 

m 

Equatorial Field for a short dipole 

-p/4it%r a 

- m / 4n r 3 

Axial Field lor a short dipole 

2p/4n%r a 

2m / 4k r 3 

External Field: torque 

p * E 

m^B 

External Field: Energy' 

-p-E 

-m»B 


Example 5.4 What is the magnitude of the equatorial and axial fields 
due to a bar magnet of length 5.0 cm at a distance of 50 cm from its 

mid-point? The magnetic moment of the bar magnet is 0.40 A m 2 , the 
same as in Example 5.2. 


Solution From Eq. (5.7) 




u..m 
4 ITT 


in 'x 0.4 10 ' x0.4 


(0.5)* 


0.125 


3.2 x 10 " T 


From Eq. (5.8), B, - -5.4 x 10 ' T 

4 k r' 


Example 5.5 Figure 5.5 shows a small magnetised needle P placed 
at a point O. The arrow shows the direction of its magnetic moment. 
The other arrows show different positions (and orientations of the 
magnetic moment) of another identical magnetised needle Q. 

(a) In which configuration the system is not in equilibrium? 

(b) In which configuration is the system in (i) stable, and (ii) unstable 
equilibrium? 

(c) Which configuration corresponds to the lowest potential energy 
among all the configurations shown? 
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Figure 5.5 

Solution 

Potential energy of the configuration arises due to the potential 
energy of one dipole (say, Q) in the magnetic field due to other (P). 
Use the result that the field due to P is given by the expression [Eqs. 
(5.7) and (5.8)]: 

B, (on the normal bisector) 

4 7T r ' 

B |; (on the axis) 

4/T r 

where mP is the magnetic moment of the dipole P. 

Equilibrium is stable when mQ is parallel to BP, and unstable when it 
is anti-parallel to BP. 

For instance for the configuration Q3 for which Q is along the 
perpendicular bisector of the dipole P, the magnetic moment of Q is 
parallel to the magnetic field at the position 3. Hence Q3 is stable. 

Thus, 






(a) PQ1 and PQ2 

(b) (i) PQ3, PQ6 (stable); (ii) PQ5, PQ4 (unstable) 

(c) PQ6 


5.3 Magnetism and Gauss’s Law 

In Chapter 1, we studied Gauss’s law for electrostatics. In Fig 5.3(c), we 
see that for a closed surface represented by i , the number of lines 
leaving the surface is equal to the number of lines entering it. This is 
consistent with the fact that no net charge is enclosed by the surface. 
However, in the same figure, for the closed surface ii , there is a net 
outward flux, since it does include a net (positive) charge. 

The situation is radically different for magnetic fields which are 
continuous and form closed loops. Examine the Gaussian surfaces 
represented by i or ii in Fig 5.3(a) or Fig. 5.3(b). Both cases visually 
demonstrate that the number of magnetic field lines leaving the surface is 
balanced by the number of lines entering it. The net magnetic flux is zero 
for both the surfaces. This is true for any closed surface. 



Karl Friedrich Gauss (1777 - 1855) He was a child prodigy and was 
gifted in mathematics, physics, engineering, astronomy and even 
land surveying. The properties of numbers fascinated him, and in his 




work he anticipated major mathematical development of later times. 
Along with Wilhelm Welser, he built the first electric telegraph in 1833. 
His mathematical theory of curved surface laid the foundation for the 
later work of Riemann. 



Figure 5.6 

Consider a small vector area element AS of a closed surface S as in Fig. 
5.6. The magnetic flux through AS is defined as A(p B = B.AS, where B is 

the field at AS. We divide S into many small area elements and calculate 
the individual flux through each. Then, the net flux cp B is, 

= X. = ^ B*AS = 0 (5.9) 

air 'air 

where ‘all’ stands for ‘all area elements AS'. Compare this with the 
Gauss’s law of electrostatics. The flux through a closed surface in that 
case is given by 

£e.as = - 2 - 

e o 

where q is the electric charge enclosed by the surface. 






The difference between the Gauss’s law of magnetism and that for 
electrostatics is a reflection of the fact that isolated magnetic poles (also 
called monopoles) are not known to exist. There are no sources or sinks 
of B; the simplest magnetic element is a dipole or a current loop. All 
magnetic phenomena can be explained in terms of an arrangement of 
dipoles and/or current loops. 

Thus, Gauss’s law for magnetism is: The net magnetic flux through any 
closed surface is zero. 


Example 5.6 Many of the diagrams given in Fig. 5.7 show magnetic 
field lines (thick lines in the figure) wrongly. Point out what is wrong 
with them. Some of them may describe electrostatic field lines 
correctly. Point out which ones. 










































Figure 5.7 

Solution 

(a) Wrong. Magnetic field lines can never emanate from a point, as 
shown in figure. Over any closed surface, the net flux of B must 
always be zero, i.e., pictorially as many field lines should seem to 
enter the surface as the number of lines leaving it. The field lines 
shown, in fact, represent electric field of a long positively charged 
wire. The correct magnetic field lines are circling the straight 
conductor, as described in Chapter 4. 

(b) Wrong. Magnetic field lines (like electric field lines) can never 
cross each other, because otherwise the direction of field at the point 
of intersection is ambiguous. There is further error in the figure. 
Magnetostatic field lines can never form closed loops around empty 
space. A closed loop of static magnetic field line must enclose a 
region across which a current is passing. By contrast, electrostatic 
field lines can never form closed loops, neither in empty space, nor 
when the loop encloses charges. 

(c) Right. Magnetic lines are completely confined within a toroid. 
Nothing wrong here in field lines forming closed loops, since each 
loop encloses a region across which a current passes. Note, for 
clarity of figure, only a few field lines within the toroid have been 
shown. Actually, the entire region enclosed by the windings contains 
magnetic field. 

(d) Wrong. Field lines due to a solenoid at its ends and outside 
cannot be so completely straight and confined; such a thing violates 
Ampere’s law. The lines should curve out at both ends, and meet 
eventually to form closed loops. 




(e) Right. These are field lines outside and inside a bar magnet. Note 
carefully the direction of field lines inside. Not all field lines emanate 
out of a north pole (or converge into a south pole). Around both the 
N-pole, and the S-pole, the net flux of the field is zero. 

(f) Wrong. These field lines cannot possibly represent a magnetic 
field. Look at the upper region. All the field lines seem to emanate out 
of the shaded plate. The net flux through a surface surrounding the 
shaded plate is not zero. This is impossible for a magnetic field. The 
given field lines, in fact, show the electrostatic field lines around a 
positively charged upper plate and a negatively charged lower plate. 
The difference between Fig. [5.7(e) and (f)] should be carefully 
grasped. 

(g) Wrong. Magnetic field lines between two pole pieces cannot be 
precisely straight at the ends. Some fringing of lines is inevitable. 
Otherwise, Ampere’s law is violated. This is also true for electric field 
lines. 


Example 5.7 

(a) Magnetic field lines show the direction (at every point) along 
which a small magnetised needle aligns (at the point). Do the 
magnetic field lines also represent the lines of force on a moving 
charged particle at every point? 

(b) Magnetic field lines can be entirely confined within the core of a 
toroid, but not within a straight solenoid. Why? 

(c) If magnetic monopoles existed, how would the Gauss’s law of 






magnetism be modified? 

(d) Does a bar magnet exert a torque on itself due to its own field? 
Does one element of a current-carrying wire exert a force on another 
element of the same wire? 

(e) Magnetic field arises due to charges in motion. Can a system 
have magnetic moments even though its net charge is zero? 

Solution 

(a) No. The magnetic force is always normal to B (remember 
magnetic force = qv * B). It is misleading to call magnetic field lines 
as lines of force. 

(b) If field lines were entirely confined between two ends of a straight 
solenoid, the flux through the cross-section at each end would be 
non-zero. But the flux of field B through any closed surface must 
always be zero. For a toroid, this difficulty is absent because it has no 
‘ends’. 

(c) Gauss’s law of magnetism states that the flux of B through any 
closed surface is always zero | B.As = 0 . 

If monopoles existed, the right hand side would be equal to the 
monopole (magnetic charge) qm enclosed by S. [Analogous to 

Gauss’s law of electrostatics, | B.As = where qm is the 

J S 

(monopole) magnetic charge enclosed by S.] 

(d) No. There is no force or torque on an element due to the field 
produced by that element itself. But there is a force (or torque) on an 
element of the same wire. (For the special case of a straight wire, this 
force is zero.) 




(e) Yes. The average of the charge in the system may be zero. Yet, 
the mean of the magnetic moments due to various current loops may 
not be zero. We will come across such examples in connection with 
paramagnetic material where atoms have net dipole moment through 
their net charge is zero. 


5.4 The Earth’s Magnetism 

Earlier we have referred to the magnetic field of the earth. The strength of 
the earth’s magnetic field varies from place to place on the earth’s 

surface; its value being of the order of 10 -5 T. 

What causes the earth to have a magnetic field is not clear. Originally the 
magnetic field was thought of as arising from a giant bar magnet placed 
approximately along the axis of rotation of the earth and deep in the 
interior. However, this simplistic picture is certainly not correct. The 
magnetic field is now thought to arise due to electrical currents produced 
by convective motion of metallic fluids (consisting mostly of molten 
iron and nickel) in the outer core of the earth. This is known as the 
dynamo effect. 

The magnetic field lines of the earth resemble that of a (hypothetical) 
magnetic dipole located at the centre of the earth. The axis of the dipole 
does not coincide with the axis of rotation of the earth but is presently 
titled by approximately 11.3° with respect to the later. In this way of 
looking at it, the magnetic poles are located where the magnetic field 
lines due to the dipole enter or leave the earth. The location of the north 
magnetic pole is at a latitude of 79.74° N and a longitude of 71.8° W, a 
place somewhere in north Canada. The magnetic south pole is at 79.74° 
S, 108.22° E in 



the Antarctica. 


The pole near the geographic north pole of the earth is called the north 
magnetic pole. Likewise, the pole near the geographic south pole is 
called the south magnetic pole. There is some confusion in the 
nomenclature of the poles. If one looks at the magnetic field lines of the 
earth (Fig. 5.8), one sees that unlike in the case of a bar magnet, the field 
lines go into the earth at the north magnetic pole (N m ) and come out from 

the south magnetic pole (S m ). The convention arose because the 

magnetic north was the direction to which the north 

pole of a magnetic needle pointed; the north pole of 

a magnet was so named as it was the north seeking 

pole. Thus, in reality, the north magnetic pole behaves like the south pole 

of a bar magnet inside the earth and vice versa. 

Magnetic Equator 
Geographic Equator 


Figure 5.8 The earth as a giant magnetic dipole. 
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Example 5.8 The earth’s magnetic field at the equator is 
approximately 0.4 G. Estimate the earth’s dipole moment. 

Solution From Eq. (5.7), the equatorial magnetic field is, 

B = ."-L m 

i—^ L’ 

4tu?"' 

We are given that BE ~ 0.4 G = 4 x 10 -5 T. For r, we take the radius 
of the earth 6.4 x 10 6 m. Hence, 

m = 4 X 10 * Cb * 4 * K)>J =4 x 10 2 x (6.4 x 10 6 ) 3 (p0/4u = 10" 7 ) 

= 1.05 x 10 23 Am 2 

This is close to the value 8 x 10 22 A m 2 quoted in geomagnetic texts. 


5.4.1 Magnetic declination and dip 

Consider a point on the earth’s surface. At such a point, the direction of 
the longitude circle determines the geographic north-south direction, the 
line of longitude towards the north pole being the direction of true north. 
The vertical plane containing the longitude circle and the axis of rotation 
of the earth is called the geographic meridian. In a similar way, one can 
define magnetic meridian of a place as the vertical plane which passes 
through the imaginary line joining the magnetic north and the south poles. 
This plane would intersect the surface of the earth in a longitude like 
circle. A magnetic needle, which is free to swing horizontally, would then 
lie in the magnetic meridian and the north pole of the needle would point 



towards the magnetic north pole. Since the line joining the magnetic 
poles is titled with respect to the geographic axis of the earth, the 
magnetic meridian at a point makes angle with the geographic meridian. 
This, then, is the angle between the true geographic north and the north 
shown by a compass needle. This angle is called the magnetic 
declination or simply declination (Fig. 5.9). 

The declination is greater at higher latitudes and smaller near the 
equator. The declination in India is small, it being 0°4T E at Delhi and 
0°58' W at Mumbai. Thus, at both these places a magnetic needle shows 
the true north quite accurately. 



Figure 5.9 A magnetic needle free to move in horizontal plane, points toward the magnetic 

north-south direction. 

There is one more quantity of interest. If a magnetic needle is perfectly 
balanced about a horizontal axis so that it can swing in a plane of the 
magnetic meridian, the needle would make an angle with the horizontal 
(Fig. 5.10). This is known as the angle of dip (also known as inclination). 



Thus, dip is the angle that the total magnetic field B E of the earth makes 

with the surface of the earth. Figure 5.11 shows the magnetic meridian 
plane at a point P on the surface of the earth. The plane is a section 
through the earth. The total magnetic field at P can be resolved into a 
horizontal component H E and a vertical component Z E . The angle that B E 


makes with H E is the angle of dip, I. 



Figure 5.10 The circle is a section through the earth containing the magnetic meridian. The angle 
between BE and the horizontal component HE is the angle of dip. 







Figure 5.11 The earth’s magnetic field, BE, its horizontal and vertical components, HE and ZE. 
Also shown are the declination, D and the inclination or angle of dip, I. 


In most of the northern hemisphere, the north pole of the dip needle tilts 
downwards. Likewise in most of the southern hemisphere, the south pole 
of the dip needle tilts downwards. 

To describe the magnetic field of the earth at a point on its surface, we 
need to specify three quantities, viz., the declination D, the angle of dip or 
the inclination I and the horizontal component of the earth’s field H E . 

These are known as the element of the earth’s magnetic field. 

Representing the verticle component by Z E , we have 

Z e = B e sinl [5.10(a)] 





H e = B e cosl [5.10(b)] 

which gives, 

, r Z £ 

tan J = -f- [5.10(c)] 






What happens to my compass needles at the poles? 

A compass needle consists of a magnetic needle which floats on a 
pivotal point. When the compass is held level, it points along the 
direction of the horizontal component of the earth’s magnetic field at 
the location. Thus, the compass needle would stay along the 
magnetic meridian of the place. In some places on the earth there are 
deposits of magnetic minerals which cause the compass needle to 
deviate from the magnetic meridian. Knowing the magnetic 
declination at a place allows us to correct the compass to determine 
the direction of true north. 















So what happens if we take our compass to the magnetic pole? At 
the poles, the magnetic field lines are converging or diverging 
vertically so that the horizontal component is negligible. If the needle 
is only capable of moving in a horizontal plane, it can point along any 
direction, rendering it useless as a direction finder. What one needs 
in such a case is a dip needle which is a compass pivoted to move in 
a vertical plane containing the magnetic field of the earth. The needle 
of the compass then shows the angle which the magnetic field makes 
with the vertical. At the magnetic poles such a needle will point 
straight down. 


Example 5.9 In the magnetic meridian of a certain place, the 
horizontal component of the earth’s magnetic field is 0.26G and the 
dip angle is 60°. What is the magnetic field of the earth at this 
location? 

Solution 

It is given that HE = 0.26 G. From Fig. 5.11, we have 

cosGO - 


II 


H,- 

cos 60 


0.20 

( 1 / 2 ) 


0.52 G 


5.5 Magnetisation and Magnetic Intensity 









The earth abounds with a bewildering variety of elements and 
compounds. In addition, we have been synthesising new alloys, 
compounds and even elements. One would like to classify the magnetic 
properties of these substances. In the present section, we define and 
explain certain terms which will help us to carry out this exercise. 

We have seen that a circulating electron in an atom has a magnetic 
moment. In a bulk material, these moments add up vectorially and they 
can give a net magnetic moment which is non-zero. We define 
magnetisation M of a sample to be equal to its net magnetic moment per 
unit volume: 

Earth’s magnetic field 

It must not be assumed that there is a giant bar magnet deep inside the 
earth which is causing the earth’s magnetic field. Although there are large 
deposits of iron inside the earth, it is highly unlikely that a large solid 
block of iron stretches from the magnetic north pole to the magnetic 
south pole. The earth’s core is very hot and molten, and the ions of iron 
and nickel are responsible for earth’s magnetism. This hypothesis seems 
very probable. Moon, which has no molten core, has no magnetic field, 
Venus has a slower rate of rotation, and a weaker magnetic field, while 
Jupiter, which has the fastest rotation rate among planets, has a fairly 
strong magnetic field. However, the precise mode of these circulating 
currents and the energy needed to sustain them are not very well 
understood. These are several open questions which form an important 
area of continuing research. 

The variation of the earth’s magnetic field with position is also an 
interesting area of study. Charged particles emitted by the sun flow 
towards the earth and beyond, in a stream called the solar wind. Their 
motion is affected by the earth’s magnetic field, and in turn, they affect 



the pattern of the earth’s magnetic field. The pattern of magnetic field 
near the poles is quite different from that in other regions of the earth. 

The variation of earth’s magnetic field with time is no less fascinating. 
There are short term variations taking place over centuries and long term 
variations taking place over a period of a million years. In a span of 240 
years from 1580 to 1820 AD, over which records are available, the 
magnetic declination at London has been found to change by 3.5°, 
suggesting that the magnetic poles inside the earth change position with 
time. On the scale of a million years, the earth’s magnetic fields has been 
found to reverse its direction. Basalt contains iron, and basalt is emitted 
during volcanic activity. The little iron magnets inside it align themselves 
parallel to the magnetic field at that place as the basalt cools and 
solidifies. Geological studies of basalt containing such pieces of 
magnetised region have provided evidence for the change of direction of 
earth’s magnetic field, several times in the past. 

M = (5.11) 

V 

M is a vector with dimensions L -1 A and is measured in a units of Am -1 . 

Consider a long solenoid of n turns per unit length and carrying a current 
I. The magnetic field in the interior of the solenoid was shown to be given 
by 

B 0 = p 0 nl (5.12) 

If the interior of the solenoid is filled with a material with non-zero 
magnetisation, the field inside the solenoid will be greater than B 0 . The 

net B field in the interior of the solenoid may be expressed as 

B = B 0 + B m (5.13) 



where B m is the field contributed by the material core. It turns out that this 
additional field B m is proportional to the magnetisation M of the material 
and is expressed as 
B m = p 0 M (5.14) 

where p 0 is the same constant (permittivity of vacuum) that appears in 
Biot-Savart’s law. 

It is convenient to introduce another vector field H, called the magnetic 
intensity, which is defined by 

B 

H =-M (5.15) 

/'<> 

where H has the same dimensions as M and is measured in units of A rrr 

i 

Thus, the total magnetic field B is written as 
B = p 0 (H + M) (5.16) 

We repeat our defining procedure. We have partitioned the contribution 
to the total magnetic field inside the sample into two parts: one, due to 
external factors such as the current in the solenoid. This is represented 
by H. The other is due to the specific nature of the magnetic material, 
namely M. The latter quantity can be influenced by external factors. This 
influence is mathematically expressed as 

M - (5.17) 

where x, a dimensionless quantity, is appropriately called the magnetic 
susceptibility. It is a measure of how a magnetic material responds to an 
external field. Table 5.2 lists x for some elements. It is small and positive 
for materials, which are called paramagnetic. It is small and negative for 



materials, which are termed diamagnetic. In the latter case M and H are 
opposite in direction. From Eqs. (5.16) and (5.17) we obtain, 

B = //Jl + /)H (5.18) 

= p 0 Mr H 

= p H (5.19) 

where p r = 1 + x, is a dimensionless quantity called the relative magnetic 

permeability of the substance. It is the analog of the dielectric constant in 
electrostatics. The magnetic permeability of the substance is p and it has 
the same dimensions and units as p 0 ; 

M = MoMr = M 1 + X)- 

The three quantities x, P r and M are interrelated and only one of 

them is independent. Given one, the other two may be easily determined. 


Table 5.2 Magnetic susceptibility of some elements at 300 K 

Diamagnetic subs lance 

X 

Paramagnetic substance 

X 

Bismuth 

-1.66 * KT 5 

Aluminium 

2.3 X 1CT 5 

Copper 

-9.8 X KT 6 

Calcium 

1.9 X KT® 

Diamond 

-2.2 X KT 5 

Chromium 

2.7 x itr 4 

Gold 

-3.6 X IQ - 5 

lithium 

2. i x itr 5 

Lead 

-1.7 X KT 5 

Magnesium 

1.2 X 10^ 

Mercuiy 

-2.9 X KT 5 

Niobium 

2.6 X KT® 

Nitrogen (STPJ 

-5.0 X 10 -6 

Oxygen (STPJ 

2.1 x 10" 6 

Silver 

-2.6 X KT 5 

Platinum 

2.9 X MT* 

Silicon 

- 4.2 x KT 6 

Tungsten 

6.8 X KT® 




Example 5.10 A solenoid has a core of a material with relative 
permeability 400. The windings of the solenoid are insulated from the 
core and carry a current of 2A. If the number of turns is 1000 per 
metre, calculate (a) H, (b) M, (c) B and (d) the magnetising current 
Im. 

Solution 

(a) The field H is dependent of the material of the core, and is 

H = nl = 1000 x 2.0 = 2 xio 3 A/m. 

(b) The magnetic field B is given by 
B = pr pO H 

= 400 x 4 tj xio- 7 (N/A 2 ) x 2 x io 3 (A/m) 

= 1.0 T 

(c) Magnetisation is given by 
M = (B- pO H)/ pO 

(d) The magnetising current IM is the additional current that needs to 
be passed through the windings of the solenoid in the absence of the 
core which would give a B value as in the presence of the core. Thus 
B = pr nO (I + IM). Using I = 2A, B = 1 T, we get IM = 794 A. 


5.6 Magnetic Properties of Materials 

The discussion in the previous section helps us to classify materials as 
diamagnetic, paramagnetic or ferromagnetic. In terms of the susceptibility 
X, a material is diamagnetic if x is negative, para- if x is positive and 
small, and ferro- if x is large and positive. 



A glance at Table 5.3 gives one a better feeling for these 
materials. Here £ is a small positive number introduced to quantify 
paramagnetic materials. Next, we describe these materials in some 
detail. 


Table 5.3 


Diamagnetic 

Paramagnetic 

F ferromagnetic 

-1 < X <0 

0 < x< £ 

X» 1 

0<fi r < 1 

1< fl r < 1+ £ 

fl r » 1 





5.6.1 Diamagnetism 

Diamagnetic substances are those which have tendency to move from 
stronger to the weaker part of the external magnetic field. In other words, 
unlike the way a magnet attracts metals like iron, it would repel a 
diamagnetic substance. 

Figure 5.12(a) shows a bar of diamagnetic material placed in an external 
magnetic field. The field lines are repelled or expelled and the field inside 
the material is reduced. In most cases, as is evident from 

Table 5.2, this reduction is slight, being one part in 10 5 . When placed in a 
non-uniform magnetic field, the bar will tend to move from high to low 
field. 

The simplest explanation for diamagnetism is as follows. Electrons in an 
atom orbiting around nucleus possess orbital angular momentum. These 
orbiting electrons are equivalent to current-carrying loop and thus 
possess orbital magnetic moment. Diamagnetic substances are the ones 
in which resultant magnetic moment in an atom is zero. When magnetic 
field is applied, those electrons having orbital magnetic moment in the 









same direction slow down and those in the opposite direction speed up. 
This happens due to induced current in accordance with Lenz’s law 
which you will study in Chapter 6. Thus, the substance develops a net 
magnetic moment in direction opposite to that of the applied field and 
hence repulsion. 




(b) 


Figure 5.12 Behaviour of magnetic field lines near a 


(a) diamagnetic, 


(b) paramagnetic substance. 


Some diamagnetic materials are bismuth, copper, lead, silicon, nitrogen 
(at STP), water and sodium chloride. Diamagnetism is present in all the 
substances. However, the effect is so weak in most cases that it gets 
shifted by other effects like paramagnetism, ferromagnetism, etc. 

The most exotic diamagnetic materials are superconductors. These are 
metals, cooled to very low temperatures which exhibits both perfect 
conductivity and perfect diamagnetism. Here the field lines are 
completely expelled! x = -1 and p r = 0. A superconductor repels a 

magnet and (by Newton’s third law) is repelled by the magnet. The 
phenomenon of perfect diamagnetism in superconductors is called the 














Meissner effect, after the name of its discoverer. Superconducting 
magnets can be gainfully exploited in variety of situations, for example, 
for running magnetically levitated superfast trains. 

5.6.2 Paramagnetism 

Paramagnetic substances are those which get weakly magnetised when 
placed in an external magnetic field. They have tendency to move from a 
region of weak magnetic field to strong magnetic field, i.e., they get 
weakly attracted to a magnet. 

The individual atoms (or ions or molecules) of a paramagnetic material 
possess a permanent magnetic dipole moment of their own. On account 
of the ceaseless random thermal motion of the atoms, no net 
magnetisation is seen. In the presence of an external field B 0 , which is 

strong enough, and at low temperatures, the individual atomic dipole 
moment can be made to align and point in the same direction as B 0 . 

Figure 5.12(b) shows a bar of paramagnetic material placed in an 
external field. The field lines gets concentrated inside the material, and 
the field inside is enhanced. In most cases, as is evident from Table 5.2, 

this enhancement is slight, being one part in 10 5 . When placed in a non- 
uniform magnetic field, the bar will tend to move from weak field to 
strong. 
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Some paramagnetic materials are aluminium, sodium, calcium, oxygen 
(at STP) and copper chloride. Experimentally, one finds that the 
magnetisation of a paramagnetic material is inversely proportional to the 
absolute temperature T, 

M = C—j- [5.20(a)] 

or equivalently, using Eqs. (5.12) and (5.17) 
l-c,f [5.20(b)] 

This is known as Curie’s law, after its discoverer Pieree Curie (1859- 
1906). The constant C is called Curie’s constant. Thus, for a 
paramagnetic material both x and p r depend not only on the material, but 

also 

(in a simple fashion) on the sample temperature. As the field is 
increased or the temperature is lowered, the magnetisation increases 
until it reaches the saturation value M s , at which point all the dipoles are 

perfectly aligned with the field. Beyond this, Curie’s law [Eq. (5.20)] is no 
longer valid. 

5.6.3 Ferromagnetism 

Ferromagnetic substances are those which gets strongly magnetised 
when placed in an external magnetic field. They have strong tendency to 
move from a region of weak magnetic field to strong magnetic field, i.e., 
they get strongly attracted to a magnet. 






(b) 

Figure 5.13 

(a) Randomly oriented domains, (b) Aligned domains. 


The individual atoms (or ions or molecules) in a ferromagnetic material 
possess a dipole moment as in a paramagnetic material. However, they 
interact with one another in such a way that they spontaneously align 
themselves in a common direction over a macroscopic volume called 
domain. The explanation of this cooperative effect requires quantum 
mechanics and is beyond the scope of this textbook. Each domain has a 
net magnetisation. Typical domain size is 1mm and the domain contains 

about 10 11 atoms. In the first instant, the magnetisation varies randomly 
from domain to domain and there is no bulk magnetisation. This is shown 
in Fig. 5.13(a). When we apply an external magnetic field B 0 , the 

domains orient themselves in the direction of B 0 and simultaneously the 

domain oriented in the direction of B 0 grow in size. This existence of 

domains and their motion in B 0 are not speculations. One may observe 







this under a microscope after sprinkling a liquid suspension of powdered 
ferromagnetic substance of samples. This motion of suspension can be 
observed. Figure 5.12(b) shows the situation when the domains have 
aligned and amalgamated to form a single ‘giant’ domain. 

Thus, in a ferromagnetic material the field lines are highly concentrated. 
In non-uniform magnetic field, the sample tends to move towards the 
region of high field. We may wonder as to what happens when the 
external field is removed. In some ferromagnetic materials the 
magnetisation persists. Such materials are called hard magnetic 
materials or hard ferromagnets. Alnico, an alloy of iron, aluminium, nickel, 
cobalt and copper, is one such material. The naturally occurring 
lodestone is another. Such materials form permanent magnets to be used 
among other things as a compass needle. On the other hand, there is a 
class of ferromagnetic materials in which the magnetisation disappears 
on removal of the external field. Soft iron is one such material. 
Appropriately enough, such materials are called soft ferromagnetic 
materials. There are a number of elements, which are ferromagnetic: 
iron, cobalt, nickel, gadolinium, etc. The relative magnetic permeability is 
> 1000 ! 

The ferromagnetic property depends on temperature. At high enough 
temperature, a ferromagnet becomes a paramagnet. The domain 
structure disintegrates with temperature. This disappearance of 
magnetisation with temperature is gradual. It is a phase transition 
reminding us of the melting of a solid crystal. The temperature of 
transition from ferromagnetic to paramagnetism is called the Curie 
temperature T c . Table 5.4 lists 

the Curie temperature of certain ferromagnets. The susceptibility 
above the Curie temperature, i.e., in the paramagnetic phase is 
described by, 



c 


(T> TJ (5.21) 


Z 


T-T c 


Table 5.4 Curie temperature T, of some 

FERROMAGNETIC MATERIALS 


Material 

T c (K) 

Cobalt 

1394 

Iron 

1043 


893 

Nickel 

631 

Gadolinium 

317 
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Example 5.11 A domain in ferromagnetic iron is in the form of a cube 
of side length 1pm. Estimate the number of iron atoms in the domain 
and the maximum possible dipole moment and magnetisation of the 
domain. The molecular mass of iron is 55 g/mole and its density 

is 7.9 g/cm 3 . Assume that each iron atom has a dipole moment 
of 9.27x10- 24 Am 2 . 

Solution The volume of the cubic domain is 
V = (10 -6 m) 3 = 1Cr 18 m 3 = 10 -12 cm 3 






Its mass is volume x density = 7.9 g cm -3 x IO -12 cm 3 = 7.9 x IO -12 g 

Table 5.4 Curie temperature Tc of some 
ferromagnetic materials 

It is given that Avagadro number (6.023 x 10 23 ) of iron atoms have a 
mass of 55 g. Hence, the number of atoms in the domain is 

7.9x10 x 0.023x10- 

jv = 

55 

= 8.65 x 10 10 atoms 

The maximum possible dipole moment mmax is achieved for the 
(unrealistic) case when all the atomic moments are perfectly aligned. 
Thus, 

mmax = (8.65 x io 10 ) x (9.27 x io- 24 ) 

= 8.0 x 10 -13 A m 2 

The consequent magnetisation is 

Mmax = mmax/Domain volume 

= 8.0 x 10 -13 Am 2 /10 -18 m 3 
= 8.0 x 10 5 Am- 1 

The relation between B and H in ferromagnetic materials is complex. It is 
often not linear and it depends on the magnetic history of the sample. 
Figure 5.14 depicts the behaviour of the material as we take it through 
one cycle of magnetisation. Let the material be unmagnetised initially. We 
place it in a solenoid and increase the current through the solenoid. The 
magnetic field B in the material rises and saturates as depicted in the 
curve Oa. This behaviour represents the alignment and merger of 



domains until no further enhancement is possible. It is pointless to 
increase the current (and hence the magnetic intensity H) beyond this. 
Next, we decrease H and reduce it to zero. At H = 0, B + 0. This is 
represented by the curve ab. The value of B at H = 0 is called retentivity 
or remanence. In Fig. 5.14, B R - 1.2 T, where the subscript R denotes 

retentivity. The domains are not completely randomised even though the 
external driving field has been removed. Next, the current in the solenoid 
is reversed and slowly increased. Certain domains are flipped until the 
net field inside stands nullified. This is represented by the curve be. The 

value of H at c is called coercivity. In Fig. 5.14 H c —90 A m -1 . As the 

reversed current is increased in magnitude, we once again obtain 
saturation. The curve cd depicts this. The saturated magnetic field B s - 

1.5 T. Next, the current is reduced (curve de) and reversed (curve ea). 
The cycle repeats itself. Note that the curve Oa does not retrace itself as 
H is reduced. For a given value of H, B is not unique but depends on 
previous history of the sample. This phenomenon is called hysterisis. The 
word hysterisis means lagging behind (and not ‘history’). 









Figure 5.14 The magnetic hysteresis loop is the B-H curve for ferromagnetic materials. 


5.7 Permanent Magnets and Electromagnets 

Substances which at room temperature retain their ferromagnetic 
property for a long period of time are called permanent magnets. 
Permanent magnets can be made in a variety of ways. One can hold an 
iron rod in the north-south direction and hammer it repeatedly. The 
method is illustrated in Fig. 5.15. The illustration is from a 400 year old 
book to emphasise that the making of permanent magnets is an old art. 
One can also hold a steel 

rod and stroke it with one end of a bar magnet a large number of times, 
always in the same sense to make a permanent magnet. 

An efficient way to make a permanent magnet is to place a ferromagnetic 
rod in a solenoid and pass a current. The magnetic field of the solenoid 
magnetises the rod. 

The hysteresis curve (Fig. 5.14) allows us to select suitable materials for 
permanent magnets. The material should have high retentivity so that the 
magnet is strong and high coercivity so that the magnetisation is not 
erased by stray magnetic fields, temperature fluctuations or minor 
mechanical damage. Further, the material should have a high 
permeability. Steel is one-favoured choice. It has a slightly smaller 
retentivity than soft iron but this is outweighed by the much smaller 
coercivity of soft iron. Other suitable materials for permanent magnets 
are alnico, cobalt steel and ticonal. 



Figure 5.15 A blacksmith forging a permanent magnet by striking a red-hot rod of iron kept in the 
north-south direction with a hammer. The sketch is recreated from an illustration in De Magnete, a 
work published in 1600 and authored by William Gilbert, the court physician to Queen Elizabeth of 

England. 

Core of electromagnets are made of ferromagnetic materials which have 
high permeability and low retentivity. Soft iron is a suitable material for 
electromagnets. On placing a soft iron rod in a solenoid and passing a 
current, we increase the magnetism of the solenoid by a thousand fold. 
When we switch off the solenoid current, the magnetism is effectively 
switched off since the soft iron core has a low retentivity. The 
arrangement is shown in Fig. 5.16. 



Figure 5.16 A soft iron core in solenoid acts as an electromagnet. 



























PHYSICS 

India’s Magnetic Field: 
http://www.iigm.res.in 

In certain applications, the material goes through an ac cycle of 
magnetisation for a long period. This is the case in transformer cores and 
telephone diaphragms. The hysteresis curve of such materials must be 
narrow. The energy dissipated and the heating will consequently be 
small. The material must have a high resistivity to lower eddy current 
losses. You will study about eddy currents in Chapter 6. 

Electromagnets are used in electric bells, loudspeakers and telephone 
diaphragms. Giant electromagnets are used in cranes to lift machinery, 
and bulk quantities of iron and steel. 


Mapping India’s magnetic field 

Because of its practical application in prospecting, communication, 
and navigation, the magnetic field of the earth is mapped by most 
nations with an accuracy comparable to geographical mapping. In 
India over a dozen observatories exist, extending from Trivandrum 
(now Thrivuvananthapuram) in the south to Gulmarg in the north. 
These observatories work under the aegis of the Indian Institute of 
Geomagnetism (IIG), in Colaba, Mumbai. The IIG grew out of the 
Colaba and Alibag observatories and was formally established in 




1971. The IIG monitors (via its nation-wide observatories), the 
geomagnetic fields and fluctuations on land, and under the ocean 
and in space. Its services are used by the Oil and Natural Gas 
Corporation Ltd. (ONGC), the National Institute of Oceanography 
(NIO) and the Indian Space Research Organisation (ISRO). It is a 
part of the world-wide network which ceaselessly updates the 
geomagnetic data. Now India has a permanent station called 
Gangotri. 


Summary 

1. The science of magnetism is old. It has been known since ancient 
times that magnetic materials tend to point in the north-south 
direction; like magnetic poles repel and unlike ones attract; and 
cutting a bar magnet in two leads to two smaller magnets. Magnetic 
poles cannot be isolated. 

2. When a bar magnet of dipole moment m is placed in a uniform 
magnetic field B, 

(a) the force on it is zero, 

(b) the torque on it is m x B, 

(c) its potential energy is -m.B, where we choose the zero of energy 
at the orientation when m is perpendicular to B. 

3. Consider a bar magnet of size I and magnetic moment m, at a 
distance r from its mid-point, where r»l, the magnetic field B due to 
this bar is, 
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4. Gauss’s law for magnetism states that the net magnetic flux 
through any closed surface is zero 

f, ;= a Bcps = () 

wv:\-{ 

C i'f-rr.w.S L& 

5. The earth’s magnetic field resembles that of a (hypothetical) 
magnetic dipole located at the centre of the earth. The pole near the 
geographic north pole of the earth is called the north magnetic pole. 
Similarly, the pole near the geographic south pole is called the south 
magnetic pole. This dipole is aligned making a small angle with the 
rotation axis of the earth. The magnitude of the field on the earth’s 

surface * 4 x io -5 T. 

6. Three quantities are needed to specify the magnetic field of the 
earth on its surface - the horizontal component, the magnetic 
declination, and the magnetic dip. These are known as the elements 
of the earth’s magnetic field. 

7. Consider a material placed in an external magnetic field BO. The 
magnetic intensity is defined as, 

H B 

{ {■ ; 

The magnetisation M of the material is its dipole moment per unit 
volume. 

The magnetic field B in the material is, 




B = pO (H + M) 

8. For a linear material M = x H. So that B = |j H and x is called the 
magnetic susceptibility of the material. The three quantities, x> the 
relative magnetic permeability pr, and the magnetic permeability p 
are related as follows: 

p = pO pr 

pr= 1+X 

9. Magnetic materials are broadly classified as: diamagnetic, 
paramagnetic, and ferromagnetic. For diamagnetic materials x is 
negative and small and for paramagnetic materials it is positive and 
small. Ferromagnetic materials have large x and are characterised by 
non-linear relation between B and H. They show the property of 
hysteresis. 

10. Substances, which at room temperature, retain their 
ferromagnetic property for a long period of time are called permanent 
magnets. 
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Points to Ponder 

1. A satisfactory understanding of magnetic phenomenon in terms of 
moving charges/currents was arrived at after 1800 AD. But 
technological exploitation of the directional properties of magnets 
predates this scientific understanding by two thousand years. Thus, 
scientific understanding is not a necessary condition for engineering 
applications. Ideally, science and engineering go hand-in-hand, one 
leading and assisting the other in tandem. 

2. Magnetic monopoles do not exist. If you slice a magnet in half, you 
get two smaller magnets. On the other hand, isolated positive and 
negative charges exist. There exists a smallest unit of charge, for 

example, the electronic charge with value |e| = 1.6 *10 -19 C. All other 
charges are integral multiples of this smallest unit charge. In other 












































words, charge is quantised. We do not know why magnetic 
monopoles do not exist or why electric charge is quantised. 

3. A consequence of the fact that magnetic monopoles do not exist is 
that the magnetic field lines are continuous and form closed loops. In 
contrast, the electrostatic lines of force begin on a positive charge 
and terminate on the negative charge (or fade out at infinity). 

4. The earth’s magnetic field is not due to a huge bar magnet inside 
it. The earth’s core is hot and molten. Perhaps convective currents in 
this core are responsible for the earth’s magnetic field. As to what 
‘dynamo’ effect sustains this current, and why the earth’s field 
reverses polarity every million years or so, we do not know. 

5. A miniscule difference in the value of x, the magnetic susceptibility, 
yields radically different behaviour: diamagnetic versus paramagnetic. 

For diamagnetic materials x = -10 -5 whereas x = +10 -5 for 
paramagnetic materials. 

6. There exists a perfect diamagnet, namely, a superconductor. This 
is a metal at very low temperatures. In this case x = -1, pr = 0, p = 0. 
The external magnetic field is totally expelled. Interestingly, this 
material is also a perfect conductor. However, there exists no 
classical theory which ties these two properties together. A quantum- 
mechanical theory by Bardeen, Cooper, and Schrieffer (BCS theory) 
explains these effects. The BCS theory was proposed ini957 and 
was eventually recognised by a Nobel Prize in physics in 1970. 

7. The phenomenon of magnetic hysteresis is reminiscent of similar 
behaviour concerning the elastic properties of materials. Strain 
may not be proportional to stress; here H and B (or M) are not 
linearly related. The stress-strain curve exhibits hysteresis and 

area enclosed by it represents the energy dissipated per unit volume. 




A similar interpretation can be given to the B-H magnetic hysteresis 
curve. 

8. Diamagnetism is universal. It is present in all materials. But it 

is weak and hard to detect if the substance is para- or ferromagnetic. 

9. We have classified materials as diamagnetic, paramagnetic, and 
ferromagnetic. However, there exist additional types of magnetic 
material such as ferrimagnetic, anti-ferromagnetic, spin glass, etc. 
with properties which are exotic and mysterious. 

Exercises 

5.1 Answer the following questions regarding earth’s magnetism: 

(a) A vector needs three quantities for its specification. Name the 
three independent quantities conventionally used to specify the 
earth’s magnetic field. 

(b) The angle of dip at a location in southern India is about 18°. 
Would you expect a greater or smaller dip angle in Britain? 

(c) If you made a map of magnetic field lines at Melbourne in 
Australia, would the lines seem to go into the ground or come out of 
the ground? 

(d) In which direction would a compass free to move in the vertical 
plane point to, if located right on the geomagnetic north or south 
pole? 

(e) The earth’s field, it is claimed, roughly approximates the field due 

to a dipole of magnetic moment 8 * 10 22 J T -1 located at its centre. 
Check the order of magnitude of this number in some way. 

(f) Geologists claim that besides the main magnetic N-S poles, there 
are several local poles on the earth’s surface oriented in different 




directions. How is such a thing possible at all? 

5.2 Answer the following questions: 

(a) The earth’s magnetic field varies from point to point in space. 
Does it also change with time? If so, on what time scale does it 
change appreciably? 

(b) The earth’s core is known to contain iron. Yet geologists do not 
regard this as a source of the earth’s magnetism. Why? 

(c) The charged currents in the outer conducting regions of the 
earth’s core are thought to be responsible for earth’s magnetism. 
What might be the ‘battery’ (i.e., the source of energy) to sustain 
these currents? 

(d) The earth may have even reversed the direction of its field several 
times during its history of 4 to 5 billion years. How can geologists 
know about the earth’s field in such distant past? 

(e) The earth’s field departs from its dipole shape substantially at 
large distances (greater than about 30,000 km). What agencies may 
be responsible for this distortion? 

(f) Interstellar space has an extremely weak magnetic field of the 

order of 10 -12 T. Can such a weak field be of any significant 
consequence? Explain. 

[Note: Exercise 5.2 is meant mainly to arouse your curiosity. Answers 
to some questions above are tentative or unknown. Brief answers 
wherever possible are given at the end. For details, you should 
consult a good text on geomagnetism.] 

5.3 A short bar magnet placed with its axis at 30° with a uniform 
external magnetic field of 0.25 T experiences a torque of magnitude 




equal to 4.5 x 10 -2 J. What is the magnitude of magnetic moment of 
the magnet? 

5.4 A short bar magnet of magnetic moment m = 0.32 JT -1 is placed 
in a uniform magnetic field of 0.15 T. If the bar is free to rotate in the 
plane of the field, which orientation would correspond to its (a) stable, 
and (b) unstable equilibrium? What is the potential energy of the 
magnet in each case? 

5.5 A closely wound solenoid of 800 turns and area of cross section 

2.5 x 10 -4 m 2 carries a current of 3.0 A. Explain the sense in which 
the solenoid acts like a bar magnet. What is its associated magnetic 
moment? 

5.6 If the solenoid in Exercise 5.5 is free to turn about the vertical 
direction and a uniform horizontal magnetic field of 0.25 T is applied, 
what is the magnitude of torque on the solenoid when its axis makes 
an angle of 30° with the direction of applied field? 

5.7 A bar magnet of magnetic moment 1.5 J T -1 lies aligned with the 
direction of a uniform magnetic field of 0.22 T. 

(a) What is the amount of work required by an external torque to turn 
the magnet so as to align its magnetic moment: (i) normal to the field 
direction, (ii) opposite to the field direction? 

(b) What is the torque on the magnet in cases (i) and (ii)? 

5.8 A closely wound solenoid of 2000 turns and area of cross-section 

1.6 x io - 4 m 2 , carrying a current of 4.0 A, is suspended through its 
centre allowing it to turn in a horizontal plane. 

(a) What is the magnetic moment associated with the solenoid? 

(b) What is the force and torque on the solenoid if a uniform 




horizontal magnetic field of 7.5 * 10 -2 T is set up at an angle of 30° 
with the axis of the solenoid? 

5.9 A circular coil of 16 turns and radius 10 cm carrying a current of 
0.75 A rests with its plane normal to an external field of magnitude 

5.0 x 10 -2 T. The coil is free to turn about an axis in its plane 
perpendicular to the field direction. When the coil is turned slightly 
and released, it oscillates about its stable equilibrium with a 

frequency of 2.0 s -1 . What is the moment of inertia of the coil about 
its axis of rotation? 

5.10 A magnetic needle free to rotate in a vertical plane parallel to the 
magnetic meridian has its north tip pointing down at 22° with the 
horizontal. The horizontal component of the earth’s magnetic field at 
the place is known to be 0.35 G. Determine the magnitude of the 
earth’s magnetic field at the place. 

5.11 At a certain location in Africa, a compass points 12° west of the 
geographic north. The north tip of the magnetic needle of a dip circle 
placed in the plane of magnetic meridian points 60° above the 
horizontal. The horizontal component of the earth’s field is measured 
to be 0.16 G. Specify the direction and magnitude of the earth’s field 
at the location. 

5.12 A short bar magnet has a magnetic moment of 0.48 J T -1 . Give 
the direction and magnitude of the magnetic field produced by the 
magnet at a distance of 10 cm from the centre of the magnet on (a) 
the axis, (b) the equatorial lines (normal bisector) of the magnet. 

5.13 A short bar magnet placed in a horizontal plane has its axis 
aligned along the magnetic north-south direction. Null points are 
found on the axis of the magnet at 14 cm from the centre of the 




magnet. The earth’s magnetic field at the place is 0.36 G and the 
angle of dip is zero. What is the total magnetic field on the normal 
bisector of the magnet at the same distance as the null-point (i.e., 14 
cm) from the centre of the magnet? (At null points, field due to a 
magnet is equal and opposite to the horizontal component of earth’s 
magnetic field.) 

5.14 If the bar magnet in exercise 5.13 is turned around by 180°, 
where will the new null points be located? 

5.15 A short bar magnet of magnetic movement 5.25 x 10 —2 J T -1 is 
placed with its axis perpendicular to the earth’s field direction. At what 
distance from the centre of the magnet, the resultant field is inclined 
at 45° with earth’s field on (a) its normal bisector and (b) its axis. 
Magnitude of the earth’s field at the place is given to be 0.42 G. 
Ignore the length of the magnet in comparison to the distances 
involved. 


Additional Exercises 

5.16 Answer the following questions: 

(a) Why does a paramagnetic sample display greater magnetisation 
(for the same magnetising field) when cooled? 

(b) Why is diamagnetism, in contrast, almost independent of 
temperature? 

(c) If a toroid uses bismuth for its core, will the field in the core be 
(slightly) greater or (slightly) less than when the core is empty? 

(d) Is the permeability of a ferromagnetic material independent of the 






magnetic field? If not, is it more for lower or higher fields? 

(e) Magnetic field lines are always nearly normal to the surface of a 
ferromagnet at every point. (This fact is analogous to the static 
electric field lines being normal to the surface of a conductor at every 
point.) Why? 

(f) Would the maximum possible magnetisation of a paramagnetic 
sample be of the same order of magnitude as the magnetisation of a 
ferromagnet? 

5.17 Answer the following questions: 

(a) Explain qualitatively on the basis of domain picture the 
irreversibility in the magnetisation curve of a ferromagnet. 

(b) The hysteresis loop of a soft iron piece has a much smaller area 
than that of a carbon steel piece. If the material is to go through 
repeated cycles of magnetisation, which piece will dissipate greater 
heat energy? 

(c) ‘A system displaying a hysteresis loop such as a ferromagnet, is a 
device for storing memory?’ Explain the meaning of this statement. 

(d) What kind of ferromagnetic material is used for coating magnetic 
tapes in a cassette player, or for building ‘memory stores’ in a modern 
computer? 

(e) A certain region of space is to be shielded from magnetic fields. 
Suggest a method. 

5.18 A long straight horizontal cable carries a current of 2.5 A in the 
direction 10° south of west to 10° north of east. The magnetic 
meridian of the place happens to be 10° west of the geographic 
meridian. The earth’s magnetic field at the location is 0.33 G, and the 
angle of dip is zero. Locate the line of neutral points (ignore the 




thickness of the cable)? (At neutral points, magnetic field due to a 
current-carrying cable is equal and opposite to the horizontal 
component of earth’s magnetic field.) 

5.19 A telephone cable at a place has four long straight horizontal 
wires carrying a current of 1.0 A in the same direction east to west. 
The earth’s magnetic field at the place is 0.39 G, and the angle of dip 
is 35°. The magnetic declination is nearly zero. What are the resultant 
magnetic fields at points 4.0 cm below the cable? 

5.20 A compass needle free to turn in a horizontal plane is placed at 
the centre of circular coil of 30 turns and radius 12 cm. The coil is in a 
vertical plane making an angle of 45° with the magnetic meridian. 
When the current in the coil is 0.35 A, the needle points west to east. 

(a) Determine the horizontal component of the earth’s magnetic field 
at the location. 

(b) The current in the coil is reversed, and the coil is rotated about its 
vertical axis by an angle of 90° in the anticlockwise sense looking 
from above. Predict the direction of the needle. Take the magnetic 
declination at the places to be zero. 

5.21 A magnetic dipole is under the influence of two magnetic fields. 
The angle between the field directions is 60°, and one of the fields 

has a magnitude of 1.2 x io -2 T. If the dipole comes to stable 
equilibrium at an angle of 15° with this field, what is the magnitude of 
the other field? 

5.22 A monoenergetic (18 keV) electron beam initially in the 
horizontal direction is subjected to a horizontal magnetic field of 0.04 
G normal to the initial direction. Estimate the up or down deflection of 

the beam over a distance of 30 cm (me = 9.11 x io -31 kg). [Note: 




Data in this exercise are so chosen that the answer will give you an 
idea of the effect of earth’s magnetic field on the motion of the 
electron beam from the electron gun to the screen in a TV set.] 

5.23 A sample of paramagnetic salt contains 2.0 * 10 24 atomic 

dipoles each of dipole moment 1.5 * 10 -23 J T -1 . The sample is 
placed under a homogeneous magnetic field of 0.64 T, and cooled to 
a temperature of 4.2 K. The degree of magnetic saturation achieved 
is equal to 15%. What is the total dipole moment of the sample for a 
magnetic field of 0.98 T and a temperature of 2.8 K? (Assume Curie’s 
law) 

5.24 A Rowland ring of mean radius 15 cm has 3500 turns of wire 
wound on a ferromagnetic core of relative permeability 800. What is 
the magnetic field B in the core for a magnetising current of 1.2 A? 

5.25 The magnetic moment vectors ps and pi associated with the 
intrinsic spin angular momentum S and orbital angular momentum I, 
respectively, of an electron are predicted by quantum theory (and 
verified experimentally to a high accuracy) to be given by: 

ps = -(e/m) S, 
pi = -(e/2m)l 

Which of these relations is in accordance with the result expected 
classically? Outline the derivation of the classical result. 




Table of Contents 


Chapter Five 
Magnetism and Matter 

5.1 Introduction 

5.2 The Bar Magnet 

5.2.1 The magnetic field lines 

5.2.2 Bar magnet as an equivalent solenoid 

5.2.3 The dipole in a uniform magnetic field 

5.2.4 The electrostatic analog 

5.3 Magnetism and Gauss’s Law 

5.4 The Earth’s Magnetism 

5.4.1 Magnetic declination and dip 

5.5 Magnetisation and Magnetic Intensity 

5.6 Magnetic Properties of Materials 

5.6.1 Diamagnetism 

5.6.2 Paramagnetism 

5.6.3 Ferromagnetism 

5.7 Permanent Magnets and Electromagnets 




















Chapter Six 
Electromagnetic 
Induction 









Chapter Six 


Electromagnetic Induction 


6.1 Introduction 

Electricity and magnetism were considered separate and unrelated 
phenomena for a long time. In the early decades of the nineteenth 
century, experiments on electric current by Oersted, Ampere and a few 
others established the fact that electricity and magnetism are inter¬ 
related. They found that moving electric charges produce magnetic fields. 
For example, an electric current deflects a magnetic compass needle 
placed in its vicinity. This naturally raises the questions like: Is the 
converse effect possible? Can moving magnets produce electric 
currents? Does the nature permit such a relation between electricity and 
magnetism? The answer is resounding yes! The experiments of Michael 
Faraday in England and Joseph Henry in USA, conducted around 1830, 
demonstrated conclusively that electric currents were induced in closed 
coils when subjected to changing magnetic fields. In this chapter, we will 
study the phenomena associated with changing magnetic fields and 
understand the underlying principles. The phenomenon in which electric 
current is generated by varying magnetic fields is appropriately called 
electromagnetic induction. 

When Faraday first made public his discovery that relative motion 
between a bar magnet and a wire loop produced a small current in the 
latter, he was asked, “What is the use of it?” His reply was: “What is the 
use of a new born baby?” The phenomenon of electromagnetic induction 
is not merely of theoretical or academic interest but also of practical 
utility. Imagine a world where there is no electricity - no electric lights, no 




trains, no telephones and no personal computers. The pioneering 
experiments of Faraday and Henry have led directly to the development 
of modern day generators and transformers. Today’s civilisation owes its 
progress to a great extent to the discovery of electromagnetic induction. 



Josheph Henry [1797 - 1878] American experimental physicist, 
professor at Princeton University and first director of the Smithsonian 
Institution. He made important improvements in electro- magnets by 
winding coils of insulated wire around iron pole pieces and invented 
an electromagnetic motor and a new, efficient telegraph. He 
discoverd self-induction and investigated how currents in one circuit 
induce currents in another. 


6.2 The Experiments of Faraday and Henry 

The discovery and understanding of electromagnetic induction are based 
on a long series of experiments carried out by Faraday and Henry. We 
shall now describe some of these experiments. 

Experiment 6.1 

Figure 6.1 shows a coil C-|* connected to a galvanometer G. When the 

North-pole of a bar magnet is pushed towards the coil, the pointer in the 
galvanometer deflects, indicating the presence of electric current in the 
coil. The deflection lasts as long as the bar magnet is in motion. The 



galvanometer does not show any deflection when the magnet is held 
stationary. When the magnet is pulled away from the coil, the 
galvanometer shows deflection in the opposite direction, which indicates 
reversal of the current’s direction. Moreover, when the South-pole of the 
bar magnet is moved towards or away from the coil, the deflections in the 
galvanometer are opposite to that observed with the North-pole for 
similar movements. Further, the deflection (and hence current) is found to 
be larger when the magnet is pushed towards or pulled away from the 
coil faster. Instead, when the bar magnet is held fixed and the coil C-| is 


moved towards or away from the magnet, the same effects are observed. 
It shows that it is the relative motion between the magnet and the coil that 
is responsible for generation (induction) of electric current in the coil. 



Figure 6.1 When the bar magnet is pushed towards the coil, the pointer in the galvanometer G 

deflects. 


Experiment 6.2 

In Fig. 6.2 the bar magnet is replaced by a second coil C 2 connected to a 

















battery. The steady current in the coil C 2 produces a steady magnetic 

field. As coil C 2 is moved towards the coil C<|, the galvanometer shows a 

deflection. This indicates that electric current is induced in coil C-|. When 

C 2 is moved away, the galvanometer shows a deflection again, but this 

time in the opposite direction. The deflection lasts as long as coil C 2 is in 

motion. When the coil C 2 is held fixed and C-| is moved, the same effects 

are observed. Again, it is the relative motion between the coils that 
induces the electric current. 

* Wherever the term ‘coil’ or ‘loop’ is used, it is assumed that they are 
made up of conducting material and are prepared using wires which are 
coated with insulating material. 



Figure 6.2 Current is induced in coil Cl due to motion of the current carrying coil C2. 

Experiment 6.3 

The above two experiments involved relative motion between a magnet 
and a coil and between two coils, respectively. Through another 

















experiment, Faraday showed that this relative motion is not an absolute 
requirement. Figure 6.3 shows two coils C 1 and C 2 held stationary. Coil 

C 1 is connected to galvanometer G while the second coil C 2 is connected 

to a battery through a tapping key K. 

It is observed that the galvanometer shows a momentary deflection when 
the tapping key K is pressed. The pointer in the galvanometer returns to 
zero immediately. If the key is held pressed continuously, there is no 
deflection in the galvanometer. When the key is released, a momentory 
deflection is observed again, but in the opposite direction. It is also 
observed that the deflection increases dramatically when an iron rod is 
inserted into the coils along their axis. 
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Interactive animation on Faraday’s experiments and Lenz’s law: 
http://micro.magnet.fsu.edu/electromagnet/java/faraday2/ 


6.3 Magnetic Flux 

Faraday’s great insight lay in discovering a simple mathematical relation 
to explain the series of experiments he carried out on electromagnetic 
induction. However, before we state and appreciate his laws, we must get 
familiar with the notion of magnetic flux, <J> B . Magnetic flux is defined in 

the same way as electric flux is defined in Chapter 1. Magnetic flux 
through a plane of area A placed in a uniform magnetic field B (Fig. 6.4) 
can be written as 

<$> B = B . A = BA cos 0 (6.1) 

where 0 is angle between B and A. The notion of the area as a vector 
has been discussed earlier in Chapter 1. Equation (6.1) can be extended 
to curved surfaces and nonuniform fields. 




Figure 6.4 A plane of surface area A placed in a uniform magnetic field B. 



Figure 6.5 Magnetic field Bj at the ft area element. dAj represents area vector of the ft area 

element. 

If the magnetic field has different magnitudes and directions at various 
parts of a surface as shown in Fig. 6.5, then the magnetic flux through 
the surface is given by 

</> = B ,<IA +B ,dA +... = X B f’ clA i (6.2) 

an 









where ‘all’ stands for summation over all the area elements dAi 
comprising the surface and Bi is the magnetic field at the area element 
dAj. The SI unit of magnetic flux is weber (Wb) or tesla meter squared (T 

m 2 ). Magnetic flux is a scalar quantity. 

6.4 Faraday’s Law of Induction 

From the experimental observations, Faraday arrived at a conclusion that 
an emf is induced in a coil when magnetic flux through the coil changes 
with time. Experimental observations discussed in Section 6.2 can be 
explained using this concept. 

The motion of a magnet towards or away from coil C-| in Experiment 6.1 

and moving a current-carrying coil C 2 towards or away from coil C-| in 

Experiment 6.2, change the magnetic flux associated with coil C-|. The 

change in magnetic flux induces emf in coil It was this induced emf 

which caused electric current to flow in coil C-| and through the 

galvanometer. A plausible explanation for the observations of Experiment 
6.3 is as follows: When the tapping key K is pressed, the current in coil 
C 2 (and the resulting magnetic field) rises from zero to a maximum value 

in a short time. Consequently, the magnetic flux through the neighbouring 
coil C-i also increases. It is the change in magnetic flux through coil C-| 

that produces an induced emf in coil When the key is held pressed, 

current in coil C 2 is constant. Therefore, there is no change in the 

magnetic flux through coil C-| and the current in coil C 1 drops to zero. 

When the key is released, the current in C 2 and the resulting magnetic 

field decreases from the maximum value to zero in a short time. This 
results in a decrease in magnetic flux through coil C-| and hence again 


induces an electric current in coil C-|*. The common point in all these 

observations is that the time rate of change of magnetic flux through a 
circuit induces emf in it. Faraday stated experimental observations in the 
form of a law called Faraday’s law of electromagnetic induction. The law 
is stated below. 

* Note that sensitive electrical instruments in the vicinity of an 
electromagnet can be damaged due to the induced emfs (and the 
resulting currents) when the electromagnet is turned on or off. 



Michael Faraday [1791-1867] Faraday made numerous 
contributions to science, viz., the discovery of electromagnetic 
induction, the laws of electrolysis, benzene, and the fact that the 
plane of polarisation is rotated in an electric field. He is also credited 
with the invention of the electric motor, the electric generator and the 
transformer. He is widely regarded as the greatest experimental 
scientist of the nineteenth century. 


The magnitude of the induced emf in a circuit is equal to the time rate of 
change of magnetic flux through the circuit. Mathematically, the induced 
emf is given by 



£ 


(6.3) 


cli 


The negative sign indicates the direction of e and hence the direction of 
current in a closed loop. This will be discussed in detail in the next 
section. 

In the case of a closely wound coil of N turns, change of flux associated 
with each turn, is the same. Therefore, the expression for the total 
induced emf is given by 


£ 


-N 


d( 


(6.4) 


The induced emf can be increased by increasing the number of turns N 
of a closed coil. 


From Eqs. (6.1) and (6.2), we see that the flux can be varied by changing 
any one or more of the terms B, A and 9. In Experiments 6.1 and 6.2 in 
Section 6.2, the flux is changed by varying B. The flux can also be 
altered by changing the shape of a coil (that is, by shrinking it or 
stretching it) in a magnetic field, or rotating a coil in a magnetic field such 
that the angle 0 between B and A changes. In these cases too, an emf is 
induced in the respective coils. 


Example 6.1 Consider Experiment 6.2. (a) What would you do to 
obtain a large deflection of the galvanometer? (b) How would you 
demonstrate the presence of an induced current in the absence of a 
galvanometer? 

Solution 

(a) To obtain a large deflection, one or more of the following steps 
can be taken: (i) Use a rod made of soft iron inside the coil C 2 , (ii) 





Connect the coil to a powerful battery, and (iii) Move the arrangement 
rapidly towards the test coil C-|. 

(b) Replace the galvanometer by a small bulb, the kind one finds in a 
small torch light. The relative motion between the two coils will cause 
the bulb to glow and thus demonstrate the presence of an induced 
current. 

In experimental physics one must learn to innovate. Michael Faraday 
who is ranked as one of the best experimentalists ever, was 
legendary for his innovative skills. 


Example 6.2 A square loop of side 10 cm and resistance 0.5 Q is 
placed vertically in the east-west plane. A uniform magnetic field of 
0.10 T is set up across the plane in the north-east direction. The 
magnetic field is decreased to zero in 0.70 s at a steady rate. 
Determine the magnitudes of induced emf and current during this 
time-interval. 

Solution The angle 9 made by the area vector of the coil with the 
magnetic field is 45°. From Eq. (6.1), the initial magnetic flux is 

(p = BA cos 0 


V2 

Final flux, O min = 0 

The change in flux is brought about in 0.70 s. From Eq. (6.3), the 
magnitude of the induced emf is given by 
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And the magnitude of the current is 
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2 m A 


Note that the earth’s magnetic field also produces a flux through the 
loop. But it is a steady field (which does not change within the time 
span of the experiment) and hence does not induce any emf. 


Example 6.3 

A circular coil of radius 10 cm, 500 turns and resistance 2 Q is placed 
with its plane perpendicular to the horizontal component of the earth’s 
magnetic field. It is rotated about its vertical diameter through 180° in 
0.25 s. Estimate the magnitudes of the emf and current induced in 
the coil. Horizontal component of the earth’s magnetic field at the 

place is 3.0 x io -5 T. 

Solution 

Initial flux through the coil, 

(initial) = BA COS 0 

= 3.0 X 10- 5 X (TT X10-2) x COS 0° 

= 3 tt x 10- 7 Wb 

Final flux after the rotation, 

Ob (final) = 3.0 x 10- 5x (tt xio- 2 ) X cos 180° 








= -3tt x 10- 7 Wb 

Therefore, estimated value of the induced emf is, 

A<P 
£= N — 

At 

= 500 x (6tt x 10 _7 )/0.25 
= 3.8 x 10“ 3 V 
I = e/R= 1.9 x 10- 3 A 

Note that the magnitudes of s and I are the estimated values. Their 
instantaneous values are different and depend upon the speed of 
rotation at the particular instant. 


6.5 Lenz’s Law and Conservation of Energy 

In 1834, German physicist Heinrich Friedrich Lenz (1804-1865) deduced 
a rule, known as Lenz’s law which gives the polarity of the induced emf in 
a clear and concise fashion. The statement of the law is: 

The polarity of induced emf is such that it tends to produce a current 
which opposes the change in magnetic flux that produced it. 

The negative sign shown in Eq. (6.3) represents this effect. We can 
understand Lenz’s law by examining Experiment 6.1 in Section 6.2.1. In 
Fig. 6.1, we see that the North-pole of a bar magnet is being pushed 
towards the closed coil. As the North-pole of the bar magnet moves 
towards the coil, the magnetic flux through the coil increases. Hence 
current is induced in the coil in such a direction that it opposes the 
increase in flux. This is possible only if the current in the coil is in a 
counter-clockwise direction with respect to an observer situated on the 




side of the magnet. Note that magnetic moment associated with this 
current has North polarity towards the North-pole of the approaching 
magnet. Similarly, if the North-pole of the magnet is being withdrawn from 
the coil, the magnetic flux through the coil will decrease. To counter this 
decrease in magnetic flux, the induced current in the coil flows in 
clockwise direction and its South-pole faces the receding North-pole of 
the bar magnet. This would result in an attractive force which opposes 
the motion of the magnet and the corresponding decrease in flux. 

What will happen if an open circuit is used in place of the closed loop in 
the above example? In this case too, an emf is induced across the open 
ends of the circuit. The direction of the induced emf can be found using 
Lenz’s law. Consider Figs. 6.6 (a) and (b). They provide an easier way to 
understand the direction of induced currents. Note that the direction 
shown by and indicate the directions of the induced currents. 

A little reflection on this matter should convince us on the correctness of 
Lenz’s law. Suppose that the induced current was in the direction 
opposite to the one depicted in Fig. 6.6(a). In that case, the South-pole 
due to the induced current will face the approaching North-pole of the 
magnet. The bar magnet will then be attracted towards the coil at an ever 
increasing acceleration. A gentle push on the magnet will initiate the 
process and its velocity and kinetic energy will continuously increase 
without expending any energy. If this can happen, one could construct a 
perpetual-motion machine by a suitable arrangement. This violates the 
law of conservation of energy and hence can not happen. 


Figure 6.6 Illustration of Lenz’s law. 


Now consider the correct case shown in Fig. 6.6(a). In this situation, the 
bar magnet experiences a repulsive force due to the induced current. 
Therefore, a person has to do work in moving the magnet. Where does 
the energy spent by the person go? This energy is dissipated by Joule 
heating produced by the induced current. 


Example 6.4 

Figure 6.7 shows planar loops of different shapes moving out of or 
into a region of a magnetic field which is directed normal to the plane 
of the loop away from the reader. Determine the direction of induced 
current in each loop using Lenz’s law. 




Figure 6.7 

Solution 

(i) The magnetic flux through the rectangular loop abed increases, 
due to the motion of the loop into the region of magnetic field, The 
induced current must flow along the path bedab so that it opposes the 
increasing flux. 

(ii) Due to the outward motion, magnetic flux through the triangular 
loop abc decreases due to which the induced current flows along 
bacb, so as to oppose the change in flux. 

(iii) As the magnetic flux decreases due to motion of the irregular 
shaped loop abed out of the region of magnetic field, the induced 
current flows along edabe, so as to oppose change in flux. 

Note that there are no induced current as long as the loops are 
completely inside or outside the region of the magnetic field. 








Example 6.5 

(a) A closed loop is held stationary in the magnetic field between the 
north and south poles of two permanent magnets held fixed. Can we 
hope to generate current in the loop by using very strong magnets? 

(b) A closed loop moves normal to the constant electric field between 
the plates of a large capacitor. Is a current induced in the loop 

(i) when it is wholly inside the region between the capacitor plates (ii) 
when it is partially outside the plates of the capacitor? The electric 
field is normal to the plane of the loop. 

(c) A rectangular loop and a circular loop are moving out of a uniform 
magnetic field region (Fig. 6.8) to a field-free region with a constant 
velocity v. In which loop do you expect the induced emf to be 
constant during the passage out of the field region? The field is 
normal to the loops. 

(d) Predict the polarity of the capacitor in the situation described by 
Fig. 6.9. 
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Figure 6.9 

Solution 

(a) No. However strong the magnet may be, current can be induced 
only by changing the magnetic flux through the loop. 

(b) No current is induced in either case. Current can not be induced 
by changing the electric flux. 

(c) The induced emf is expected to be constant only in the case of the 
rectangular loop. In the case of circular loop, the rate of change of 
area of the loop during its passage out of the field region is not 
constant, hence induced emf will vary accordingly. 

(d) The polarity of plate ‘A’ will be positive with respect to plate ‘B’ in 
the capacitor. 


6.6 Motional Electromotive Force 

Let us consider a straight conductor moving in a uniform and time- 
independent magnetic field. Figure 6.10 shows a rectangular conductor 
PQRS in which the conductor PQ is free to move. The rod PQ is moved 
towards the left with a constant velocity v as shown in the figure. Assume 
that there is no loss of energy due to friction. PQRS forms a closed circuit 
enclosing an area that changes as PQ moves. It is placed in a uniform 
magnetic field B which is perpendicular to the plane of this system. If the 















length RQ = x and RS = I, the magnetic flux (p B enclosed by the loop 
PQRS will be 
(p B = BIx 


Since x is changing with time, the rate of change of flux cp B will induce an 
emf given by: 


x 

x 

X 
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Figure 6.10 The arm PQ is moved to the left side, thus decreasing the area of the rectangular 

loop. This movement 
induces a current I as shown. 


df d t 
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= -/*!_= /flu (6.5) 
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where we have used dx/dt = -v which is the speed of the conductor PQ. 
The induced emf Blv is called motional emf. Thus, we are able to produce 
induced emf by moving a conductor instead of varying the magnetic field, 
that is, by changing the magnetic flux enclosed by the circuit. 




















PHYSICS 

Interactive animation on motional emf: 

http://ngsir.netfirms.com/englishhtm/lnduction.htm 

http://webphysics.davidson.edu/physlet_resources/bu_semester2/index 

It is also possible to explain the motional emf expression in Eq. (6.5) by 
invoking the Lorentz force acting on the free charge carriers of conductor 
PQ. Consider any arbitrary charge q in the conductor PQ. When the rod 
moves with speed v, the charge will also be moving with speed v in the 
magnetic field B. The Lorentz force on this charge is qvB in magnitude, 
and its direction is towards Q. All charges experience the same force, in 
magnitude and direction, irrespective of their position in the rod PQ. 

The work done in moving the charge from P to Q is, 

W = qvBI 

Since emf is the work done per unit charge, 

W 

v ~ — 

Q 

= Blv 

This equation gives emf induced across the rod PQ and is identical to Eq. 
(6.5). We stress that our presentation is not wholly rigorous. But it does 
help us to understand the basis of Faraday’s law when the conductor is 
moving in a uniform and time-independent magnetic field. 

On the other hand, it is not obvious how an emf is induced when a 
conductor is stationary and the magnetic field is changing - a fact which 




Faraday verified by numerous experiments. In the case of a stationary 
conductor, the force on its charges is given by 

F = q (E + v * B) = qE ( 6 . 6 ) 

since v = 0. Thus, any force on the charge must arise from the electric 
field term E alone. Therefore, to explain the existence of induced emf or 
induced current, we must assume that a time-varying magnetic field 
generates an electric field. However, we hasten to add that electric fields 
produced by static electric charges have properties different from those 
produced by time-varying magnetic fields. In Chapter 4, we learnt that 
charges in motion (current) can exert force/torque on a stationary 
magnet. Conversely, a bar magnet in motion (or more generally, a 
changing magnetic field) can exert a force on the stationary charge. This 
is the fundamental significance of the Faraday’s discovery. Electricity and 
magnetism are related. 


Example 6.6 A metallic rod of 1 m length is rotated with a frequency 
of 50 rev/s, with one end hinged at the centre and the other end at 
the circumference of a circular metallic ring of radius 1 m, about an 
axis passing through the centre and perpendicular to the plane of the 
ring (Fig. 6.11). A constant and uniform magnetic field of 1 T parallel 
to the axis is present everywhere. What is the emf between the 
centre and the metallic ring? 
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Figure 6.11 

Solution 

Method I 

As the rod is rotated, free electrons in the rod move towards the outer 
end due to Lorentz force and get distributed over the ring. Thus, the 
resulting separation of charges produces an emf across the ends of 
the rod. At a certain value of emf, there is no more flow of electrons 
and a steady state is reached. Using Eq. (6.5), the magnitude of the 
emf generated across a length dr of the rod as it moves at right 
angles to the magnetic field is given by 

d*: = Brc\r. Hence, 



Note that we have used v = wr. This gives 


£ = x1.ux2jc x 50 x f 1" i 
2 











= 157 V 


Method II 


To calculate the emf, we can imagine a closed loop OPQ in which 
point O and P are connected with a resistor R and OQ is the rotating 
rod. The potential difference across the resistor is then equal to the 
induced emf and equals B x (rate of change of area of loop). If 9 is 
the angle between the rod and the radius of the circle at P at time t, 
the area of the sector OPQ is given by 


where R is the radius of the circle. Hence, the induced emf is 


£ = B x — 
df 
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This expression is identical to the expression obtained by Method I 
and we get the same value of £. 


Example 6.7 

A wheel with 10 metallic spokes each 0.5 m long is rotated with a 
speed of 120 rev/min in a plane normal to the horizontal component 
of earth’s magnetic field HE at a place. If HE = 0.4 G at the place, 
what is the induced emf between the axle and the rim of the wheel? 

Note that 1 G = lO^T. 

Solution 











Induced emf = (1/2) wBR 2 
= ( 1 / 2 ) x 4 tt x 0.4 x lO^x ^) 2 
= 6.28 x io- 5 V 

The number of spokes is immaterial because the emfs across the 
spokes are in parallel. 


6.7 Energy Consideration: A Quantitative Study 

In Section 6.5, we discussed qualitatively that Lenz’s law is consistent 
with the law of conservation of energy. Now we shall explore this aspect 
further with a concrete example. 

Let r be the resistance of movable arm PQ of the rectangular conductor 
shown in Fig. 6.10. We assume that the remaining arms QR, RS and SP 
have negligible resistances compared to r. Thus, the overall resistance of 
the rectangular loop is r and this does not change as PQ is moved. The 
current I in the loop is, 

j = £ 
r 


r 

On account of the presence of the magnetic field, there will be a force on 
the arm PQ. This force I (I * B), is directed outwards in the direction 
opposite to the velocity of the rod. The magnitude of this force is, 

F = I I B = — H 

r 


where we have used Eq. (6.7). Note that this force arises due to drift 




velocity of charges (responsible for current) along the rod and the 
consequent Lorentz force acting on them. Alternatively, the arm PQ is 
being pushed with a constant speed v, the power required to do this is, 

P-F v 



2 


r 


( 6 . 8 ) 


The agent that does this work is mechanical. Where does this 
mechanical energy go? The answer is: it is dissipated as Joule heat, and 
is given by 


rj 


which is identical to Eq. (6.8). 

Thus, mechanical energy which was needed to move the arm PQ is 
converted into electrical energy (the induced emf) and then to thermal 
energy. 

There is an interesting relationship between the charge flow through the 
circuit and the change in the magnetic flux. From Faraday’s law, we have 
learnt that the magnitude of the induced emf is, 
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Example 6.8 Refer to Fig. 6.12(a). The arm PQ of the rectangular 
conductor is moved from x = 0, outwards. The uniform magnetic field 
is perpendicular to the plane and extends from x = 0 to x = b and is 
zero for x > b. Only the arm PQ possesses substantial resistance r. 
Consider the situation when the arm PQ is pulled outwards from x = 0 
to x = 2b, and is then moved back to x = 0 with constant speed v. 
Obtain expressions for the flux, the induced emf, the force necessary 
to pull the arm and the power dissipated as Joule heat. Sketch the 
variation of these quantities with distance. 
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Figure 6.12 

Solution Let us first consider the forward motion from x = 0 to x = 2b 
The flux <t> B linked with the circuit SPQR is 


































f/jr, = Bl X 0 < A' < b 

= Ii/b b<x<2b 
The induced emf is, 
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= -Bh' 0 <x<b 
- 0 b < x < 2b 

When the induced emf is non-zero, the current I is (in magnitude) 
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The force required to keep the arm PQ in constant motion is I IB. Its 
direction is to the left. In magnitude 
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= 0 L < x < 21 j 

The Joule heating loss is 

P, =i a r 

/fT u v 

(.) < a < b 

r 

= 0 b < a < 2 /; 

One obtains similar expressions for the inward motion from x = 2b to 
x = 0. One can appreciate the whole process by examining the sketch 
of various quantities displayed in Fig. 6.12(b). 


6.8 Eddy Currents 

So far we have studied the electric currents induced in well defined paths 
in conductors like circular loops. Even when bulk pieces of conductors 
are subjected to changing magnetic flux, induced currents are produced 
in them. However, their flow patterns resemble swirling eddies in water. 
This effect was discovered by physicist Foucault (1819-1868) and these 
currents are called eddy currents. 

Consider the apparatus shown in Fig. 6.13. A copper plate is allowed to 
swing like a simple pendulum between the pole pieces of a strong 
magnet. It is found that the motion is damped and in a little while the 
plate comes to a halt in the magnetic field. We can explain this 
phenomenon on the basis of electromagnetic induction. Magnetic flux 
associated with the plate keeps on changing as the plate moves in and 
out of the region between magnetic poles. The flux change induces eddy 
currents in the plate. Directions of eddy currents are opposite when the 



plate swings into the region between the poles and when it swings out of 
the region. 



Figure 6.13 Eddy currents are generated in the copper plate, while entering and leaving the region 

of magnetic field. 

If rectangular slots are made in the copper plate as shown in Fig. 6.14, 
area available to the flow of eddy currents is less. Thus, the pendulum 
plate with holes or slots reduces electromagnetic damping and the plate 
swings more freely. Note that magnetic moments of the induced currents 
(which oppose the motion) depend upon the area enclosed by the 
currents (recall equation m = IA in Chapter 4). 

This fact is helpful in reducing eddy currents in the metallic cores of 
transformers, electric motors and other such devices in which a coil is to 
be wound over metallic core. Eddy currents are undesirable since they 
heat up the core and dissipate electrical energy in the form of heat. Eddy 
currents are minimised by using laminations of metal to make a metal 






core. The laminations are separated by an insulating material like 
lacquer. The plane of the laminations must be arranged parallel to the 
magnetic field, so that they cut across the eddy current paths. This 
arrangement reduces the strength of the eddy currents. Since the 
dissipation of electrical energy into heat depends on the square of the 
strength of electric current, heat loss is substantially reduced. 



Figure 6.14 Cutting slots in the copperplate reduces the effect of eddy currents. 

Eddy currents are used to advantage in certain applications like: 

(i) Magnetic braking in trains: Strong electromagnets are situated above 
the rails in some electrically powered trains. When the electromagnets 
are activated, the eddy currents induced in the rails oppose the motion of 
the train. As there are no mechanical linkages, the braking effect is 


smooth. 


(ii) Electromagnetic damping: Certain galvanometers have a fixed core 
made of nonmagnetic metallic material. When the coil oscillates, the eddy 




currents generated in the core oppose the motion and bring the coil to 
rest quickly. 

(iii) Induction furnace: Induction furnace can be used to produce high 
temperatures and can be utilised to prepare alloys, by melting the 
constituent metals. A high frequency alternating current is passed 
through a coil which surrounds the metals to be melted. The eddy 
currents generated in the metals produce high temperatures sufficient to 
melt it. 

(iv) Electric power meters: The shiny metal disc in the electric power 
meter (analogue type) rotates due to the eddy currents. Electric currents 
are induced in the disc by magnetic fields produced by sinusoidally 
varying currents in a coil. 

You can observe the rotating shiny disc in the power meter of your 
house. 


Electromagnetic damping 

Take two hollow thin cylindrical pipes of equal internal diameters 
made of aluminium and PVC, respectively. Fix them vertically with 
clamps on retort stands. Take a small cylinderical magnet having 
diameter slightly smaller than the inner diameter of the pipes and 
drop it through each pipe in such a way that the magnet does not 
touch the sides of the pipes during its fall. You will observe that the 
magnet dropped through the PVC pipe takes the same time to come 
out of the pipe as it would take when dropped through the same 
height without the pipe. Note the time it takes to come out of the pipe 
in each case. You will see that the magnet takes much longer time in 
the case of aluminium pipe. Why is it so? It is due to the eddy 



currents that are generated in the aluminium pipe which oppose the 
change in magnetic flux, i.e., the motion of the magnet. The retarding 
force due to the eddy currents inhibits the motion of the magnet. 
Such phenomena are referred to as electromagnetic damping. Note 
that eddy currents are not generated in PVC pipe as its material is an 
insulator whereas aluminium is a conductor. 


6.9 Inductance 

An electric current can be induced in a coil by flux change produced by 
another coil in its vicinity or flux change produced by the same coil. 
These two situations are described separately in the next two sub¬ 
sections. However, in both the cases, the flux through a coil is 
proportional to the current. That is, cp B a I. 

Further, if the geometry of the coil does not vary with time then, 

cUP r; d I 

-- oc - 

dt d t 

For a closely wound coil of N turns, the same magnetic flux is linked with 
all the turns. When the flux (p B through the coil changes, each turn 

contributes to the induced emf. Therefore, a term called flux linkage is 
used which is equal to N<t> B for a closely wound coil and in such a case 

N(p B cc I 

The constant of proportionality, in this relation, is called inductance. We 
shall see that inductance depends only on the geometry of the coil and 
intrinsic material properties. This aspect is akin to capacitance which for 
a parallel plate capacitor depends on the plate area and plate separation 
(geometry) and the dielectric constant K of the intervening medium 



(intrinsic material property). 

Inductance is a scalar quantity. It has the dimensions of [M L 2 T -2 A -2 ] 
given by the dimensions of flux divided by the dimensions of current. The 
SI unit of inductance is henry and is denoted by H. It is named in honour 
of Joseph Henry who discovered electromagnetic induction in USA, 
independently of Faraday in England. 

6.9.1 Mutual inductance 

Consider Fig. 6.15 which shows two long co-axial solenoids each of 
length I. We denote the radius of the inner solenoid S 1 by r 1 and the 
number of turns per unit length by n v The corresponding quantities for 
the outer solenoid S 2 are r 2 and n 2 , respectively. Let N 1 and N 2 be the 
total number of turns of coils S 1 and S 2 , respectively. 

When a current l 2 is set up through S 2 , it in turn sets up a magnetic flux 
through S-|. Let us denote it by The corresponding flux linkage with 
solenoid S-| is 

N 1 - M,,/, (6.9) 

M 12 is called the mutual inductance of solenoid S 1 with respect to 
solenoid S 2 . It is also referred to as the coefficient of mutual induction. 

For these simple co-axial solenoids it is possible to calculate M 12 . The 
magnetic field due to the current l 2 in S 2 is p 0 n 2 l 2 . The resulting flux 
linkage with coil S 1 is, 



upithjRifl L, (6.10) where n-|l is the total number of turns in solenoid 
S-,. Thus, from Eq. (6.9) and Eq. (6.10), 

M 12 = Mo n i n 2 Trr2 1 l (6.11) 

Note that we neglected the edge effects and considered the magnetic 
field Mo n 2^2 to be uniform throughout the length and width of the solenoid 

S 2 . This is a good approximation keeping in mind that the solenoid is 

long, implying I » r 2 . 

We now consider the reverse case. A current l-| is passed through the 
solenoid S-| and the flux linkage with coil S 2 is, 

N 2 (P 2 = M 21 I-, (6.12) 

M 21 is called the mutual inductance of solenoid S 2 with respect to 
solenoid S-,. 

The flux due to the current ^ in S-| can be assumed to be confined solely 
inside S 1 since the solenoids are very long. Thus, flux linkage with 
solenoid S 2 is 




N, turns r 


JV S turns 


Figure 6.15 Two long co-axial solenoids of same length I. 

N,A = M (jtjf) 1,Z,) 

where n 2 l is the total number of turns of S 2 . From Eq. (6.12), 


M 21 = |j 0 n 1 n 2 irr 2 1 1 (6.13) 

Using Eq. (6.11) and Eq. (6.12), we get 
M 12 — M 2 -|— M (say) (6.14) 

We have demonstrated this equality for long co-axial solenoids. However, 
the relation is far more general. Note that if the inner solenoid was much 
shorter than (and placed well inside) the outer solenoid, then we could 
still have calculated the flux linkage N 1 0 1 because the inner solenoid is 

effectively immersed in a uniform magnetic field due to the outer 
solenoid. In this case, the calculation of M 12 would be easy. However, it 

would be extremely difficult to calculate the flux linkage with the outer 



solenoid as the magnetic field due to the inner solenoid would vary 
across the length as well as cross section of the outer solenoid. 
Therefore, the calculation of M 21 would also be extremely difficult in this 

case. The equality M 12 =M 2 -| is very useful in such situations. 

We explained the above example with air as the medium within the 
solenoids. Instead, if a medium of relative permeability p r had been 

present, the mutual inductance would be 
M =pr p 0 n^u r 2 1 1 

It is also important to know that the mutual inductance of a pair of coils, 
solenoids, etc., depends on their separation as well as their relative 
orientation. 


Example 6.9 Two concentric circular coils, one of small radius r 1 and 

the other of large radius r 2 , such that r 1 « r 2 , are placed co-axially 

with centres coinciding. Obtain the mutual inductance of the 
arrangement. 

Solution Let a current 12 flow through the outer circular coil. The field 
at the centre of the coil is B 2 = p 0 l 2 / 2r 2 . Since the other 

co-axially placed coil has a very small radius, B 2 may be considered 

constant over its cross-sectional area. Hence, 



— M-|2 I 2 
Thus, 

M, , = — 1TI ' 

2r. 

From Eq. (6.14) 


Note that we calculated M 12 from an approximate value of 0 1 , 

assuming the magnetic field B 2 to be uniform over the area tt r^. 
However, we 

can accept this value because r 1 « r 2 . 

Now, let us recollect Experiment 6.3 in Section 6.2. In that 
experiment, emf is induced in coil C 1 wherever there was any change 

in current through coil C 2 . Let (P 1 be the flux through coil C-| (say of 

N 1 turns) when current in coil C 2 is l 2 . 

Then, from Eq. (6.9), we have 
NiCpi — Ml 2 

For currents varrying with time, 

d(7V < 1 \) _ ) 

(.11 d i 

Since induced emf in coil C-| is given by 

d(;v ,<?•!) 






It shows that varying current in a coil can induce emf in a 
neighbouring coil. The magnitude of the induced emf depends upon 
the rate of change of current and mutual inductance of the two coils. 


6.9.2 Self-inductance 

In the previous sub-section, we considered the flux in one solenoid due to 
the current in the other. It is also possible that emf is induced in a single 
isolated coil due to change of flux through the coil by means of varying 
the current through the same coil. This phenomenon is called self- 
induction. In this case, flux linkage through a coil of N turns is 
proportional to the current through the coil and is expressed as 

N0 fl oc i 

N&u = LI (6.15) 

where constant of proportionality L is called self-inductance of the coil. It 
is also called the coefficient of self-induction of the coil. When the current 
is varied, the flux linked with the coil also changes and an emf is induced 
in the coil. Using Eq. (6.15), the induced emf is given by 

d(A r <8,) 

c ~ 

dt 

e = -L — (6.16) 

cl I 

Thus, the self-induced emf always opposes any change (increase or 
decrease) of current in the coil. 






It is possible to calculate the self-inductance for circuits with simple 
geometries. Let us calculate the self-inductance of a long solenoid of 
cross-sectional area A and length I, having n turns per unit length. The 
magnetic field due to a current I flowing in the solenoid is B = p 0 n I 

(neglecting edge effects, as before). The total flux linked with the 
solenoid is 

N&u {nl)(/i 0 n 0(A) 

= // ri A 1 
■ '.) 

where nl is the total number of turns. Thus, the self-inductance is, 

L = 

I 

= u : ,n 2 Al (6.17) 

If we fill the inside of the solenoid with a material of relative permeability 
pr (for example soft iron, which has a high value of relative permeability), 
then, 

L - a, n 2 Al (6.18) 

The self-inductance of the coil depends on its geometry and on the 
permeability of the medium. 

The self-induced emf is also called the back emf as it opposes any 
change in the current in a circuit. Physically, the self-inductance plays the 
role of inertia. It is the electromagnetic analogue of mass in mechanics. 
So, work needs to be done against the back emf (z) in establishing the 
current. This work done is stored as magnetic potential energy. For the 
current I at an instant in a circuit, the rate of work done is 



If we ignore the resistive losses and consider only inductive effect, then 
using Eq. (6.16), 

dlV _ (U 

- 1 _s 1 

dr dr 

Total amount of work done in establishing the current I is 

i 

W = fdW = f LI dJ 
0 

Thus, the energy required to build up the current I is, 

W = 1 U (6.19) 

2 

This expression reminds us of mv 2 /2 for the (mechanical) kinetic energy 
of a particle of mass m, and shows that L is analogous to m (i.e., L is 
electrical inertia and opposes growth and decay of current in the circuit). 

Consider the general case of currents flowing simultaneously in two 
nearby coils. The flux linked with one coil will be the sum of two fluxes 
which exist independently. Equation (6.9) would be modified into 

Ni'/'-MJ, M i 2 I, z 

where represents inductance due to the same coil. 


Therefore, using Faraday’s law, 


f, = -M, | 


d/, 


:U 



d/ 2 

d/. 


M-i-i is the self-inductance and is written as Therefore, 
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Example 6.10 (a) Obtain the expression for the magnetic energy 
stored in a solenoid in terms of magnetic field B, area A and length I 
of the solenoid, (b) How does this magnetic energy compare with the 
electrostatic energy stored in a capacitor? 

Solution 


(a) From Eq. (6.19), the magnetic energy is 



' B ' 

m n 

ll v 11 / 


(since B = for a solenoid) 
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[from Eq. (6.17)] 
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(b) The magnetic energy per unit volume is, 









V 


(where V is volume that contains flux) 



= ( 6 . 20 ) 


We have already obtained the relation for the electrostatic energy 
stored per unit volume in a parallel plate capacitor (refer to Chapter 2, 
Eq. 2.77), 

Ur = (2.77) 

■ 2 

In both the cases energy is proportional to the square of the field 
strength. Equations (6.20) and (2.77) have been derived for special 
cases: a solenoid and a parallel plate capacitor, respectively. But they 
are general and valid for any region of space in which a magnetic 
field or/and an electric field exist. 


6.10 AC Generator 

The phenomenon of electromagnetic induction has been technologically 
exploited in many ways. An exceptionally important application is the 
generation of alternating currents (ac). The modern ac generator with a 
typical output capacity of 100 MW is a highly evolved machine. In this 
section, we shall describe the basic principles behind this machine. The 
Yugoslav inventor Nicola Tesla is credited with the development of the 
machine. As was pointed out in Section 6.3, one method to induce an 
emf or current in a loop is through a change in the loop’s orientation or a 
change in its effective area. As the coil rotates in a magnetic field B, the 



effective area of the loop (the face perpendicular to the field) is A cos 0, 
where 0 is the angle between A and B. This method of producing a flux 
change is the principle of operation of a simple ac generator. An ac 
generator converts mechanical energy into electrical energy. 

The basic elements of an ac generator are shown in Fig. 6.16. It consists 
of a coil mounted on a rotor shaft. The axis of rotation of the coil is 
perpendicular to the direction of the magnetic field. The coil (called 
armature) is mechanically rotated in the uniform magnetic field by some 
external means. The rotation of the coil causes the magnetic flux through 
it to change, so an emf is induced in the coil. The ends of the coil are 
connected to an external circuit by means of slip rings and brushes. 

When the coil is rotated with a constant angular speed co, the angle 0 
between the magnetic field vector B and the area vector A of the coil at 
any instant t is 0 = cot (assuming 0 = 0° at t = 0). As a result, the effective 
area of the coil exposed to the magnetic field lines changes with time, 
and from Eq. (6.1), the flux at any time t is 

0 B = BA cos 0 = BA cos cot 

From Faraday’s law, the induced emf for the rotating coil of N turns is 
then, 

c= iv d ^= NBA d (coscat) 
dt dr 

Thus, the instantaneous value of the emf is 

f = AIM co sin col (6.21) 

where NBAco is the maximum value of the emf, which occurs when 
sin cot = ±1. If we denote NBAco as e 0 , then 
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Figure 6.16 AC Generator 


£ = £ 0 sin cot (6.22) 

Since the value of the sine fuction varies between +1 and -1, the sign, or 
polarity of the emf changes with time. Note from Fig. 6.17 that the emf 
has its extremum value when 0 = 90° or 0 = 270°, as the change of flux is 
greatest at these points. 

The direction of the current changes periodically and therefore the 
current is called alternating current (ac). Since co = 2ttv, Eq (6.22) can be 
written as 


£ = £ 0 sin 2tt v t (6.23) 

where v is the frequency of revolution of the generator’s coil. 























Note that Eq. (6.22) and (6.23) give the instantaneous value of the emf 
and s varies between +£ 0 and -e 0 periodically. We shall learn how to 

determine the time-averaged value for the alternating voltage and current 
in the next chapter. 

Stage 1 Stage 2 Stages 

The plane of When the armature armature 
the armature is rotates through 90° after a Staged 

perpendicular the plane of the rot at ion of Stage4 Armature after 



Figure 6.17 An alternating emf is generated by a loop of wire rotating in a magnetic field. 


In commercial generators, the mechanical energy required for rotation of 
the armature is provided by water falling from a height, for example, from 
dams. These are called hydro-electric generators. Alternatively, water is 
heated to produce steam using coal or other sources. The steam at high 
pressure produces the rotation of the armature. These are called thermal 
generators. Instead of coal, if a nuclear fuel is used, we get nuclear 
power generators. Modern day generators produce electric power as high 
as 500 MW, i.e., one can light up 5 million 100 W bulbs! In most 
generators, the coils are held stationary and it is the electromagnets 
which are rotated. The frequency of rotation is 50 Hz in India. In certain 
countries such as USA, it is 60 Hz. 

























Example 6.11 Kamla peddles a stationary bicycle. The pedals of the 

bicycle are attached to a 100 turn coil of area 0.10 m 2 . The coil 
rotates at half a revolution per second and it is placed in a uniform 
magnetic field of 0.01 T perpendicular to the axis of rotation of the 
coil. What is the maximum voltage generated in the coil? 

Solution Here v = 0.5 Hz; N =100, A = 0.1 m 2 and B = 0.01 T. 
Employing Eq. (6.21) 

e0 = NBA (2 tt v) 

= 100 x 0.01 x 0.1 x 2 x 3.14 x 0.5 
= 0.314 V 

The maximum voltage is 0.314 V. 

We urge you to explore such alternative possibilities for power 
generation. 


Migration of birds 

The migratory pattern of birds is one of the mysteries in the field of 
biology, and indeed all of science. For example, every winter birds 
from Siberia fly unerringly to water spots in the Indian subcontinent. 
There has been a suggestion that electromagnetic induction may 
provide a clue to these migratory patterns. The earth’s magnetic field 
has existed throughout evolutionary history. It would be of great 
benefit to migratory birds to use this field to determine the direction. 
As far as we know birds contain no ferromagnetic material. So 






electromagnetic induction seems to be the only reasonable 
mechanism to determine direction. Consider the optimal case where 
the magnetic field B, the velocity of the bird v, and two relevant points 
of its anatomy separated by a distance I, all three are mutually 
perpendicular. From the formula for motional emf, Eq. (6.5), 

£ = Blv 

Taking B = 4 x 10-5 T, I = 2 cm wide, and v = 10 m/s, we obtain 
£ = 4 x 10-5 x 2 x 10-2 x 10 V = 8 x 10-6 V 
= 8 pV 

This extremely small potential difference suggests that our hypothesis 
is of doubtful validity. Certain kinds of fish are able to detect small 
potential differences. However, in these fish, special cells have been 
identified which detect small voltage differences. In birds no such 
cells have been identified. Thus, the migration patterns of birds 
continues to remain a mystery. 


Summary 

1. The magnetic flux through a surface of area A placed in a uniform 
magnetic field B is defined as, 

cpB = B.A = BA cos 0 

where 0 is the angle between B and A. 

2. Faraday’s laws of induction imply that the emf induced in a coil of 
N turns is directly related to the rate of change of flux through it, 






d t 

Here <t>B is the flux linked with one turn of the coil. If the circuit is 
closed, a current I = s/R is set up in it, where R is the resistance of 
the circuit. 

3. Lenz’s law states that the polarity of the induced emf is such that it 
tends to produce a current which opposes the change in magnetic 
flux that produces it. The negative sign in the expression for 
Faraday’s law indicates this fact. 

4. When a metal rod of length I is placed normal to a uniform 
magnetic field B and moved with a velocity v perpendicular to the 
field, the induced emf (called motional emf) across its ends is 

£ — Bl V 

5. Changing magnetic fields can set up current loops in nearby metal 
(any conductor) bodies. They dissipate electrical energy as heat. 
Such currents are eddy currents. 

6. Inductance is the ratio of the flux-linkage to current. It is equal to 
NO/I. 

7. A changing current in a coil (coil 2) can induce an emf in a nearby 
coil (coil 1). This relation is given by, 

di :> 

t'- - fi-M,.,— ±- 
dr 

The quantity Ml2 is called mutual inductance of coil 1 with respect to 
coil 2. One can similarly define M21. There exists a general equality, 

M12=M21 

8. When a current in a coil changes, it induces a back emf in the 





same coil. The self-induced emf is given by, 

r = -L — 
d< 

L is the self-inductance of the coil. It is a measure of the inertia of the 
coil against the change of current through it. 

9. The self-inductance of a long solenoid, the core of which consists 
of a magnetic material of permeability pr, is given by 

L = pr pO n2 Al 

where A is the area of cross-section of the solenoid, I its length and n 
the number of turns per unit length. 

10. In an ac generator, mechanical energy is converted to electrical 
energy by virtue of electromagnetic induction. If coil of N turn and 
area A is rotated at v revolutions per second in a uniform magnetic 
field B, then the motional emf produced is 

£ = NBA (2ttv) sin (2uvt) 

where we have assumed that at time t = 0 s, the coil is perpendicular 
to the field. 


Quantity 

Symbol 

Units 

Dimensions 

Equations 

Magnetic Flux 

<p 

Wb (weber) 

IML 2 T ‘ 2 A _1 | 

^ = B * A 

EMF 

E 

V (volt) 

[ML 2 !" 1 A' 1 ] 

e - -d(jV<Z>J /di 

Mutual Inductance 

M 

H (henry) 

LM L 2 T " 2 A" 2 ] 

£l =-JW ia {di a /dt) 

Self Inductance 

L 

H (henry) 

[ml 2 t- 2 a- 2 i 

-L{di/dt) 


Points to Ponder 








1. Electricity and magnetism are intimately related. In the early part of 
the nineteenth century, the experiments of Oersted, Ampere and 
others established that moving charges (currents) produce a 
magnetic field. Somewhat later, around 1830, the experiments of 
Faraday and Henry demonstrated that a moving magnet can induce 
electric current. 

2. In a closed circuit, electric currents are induced so as to oppose 
the changing magnetic flux. It is as per the law of conservation of 
energy. However, in case of an open circuit, an emf is induced across 
its ends. How is it related to the flux change? 

3. The motional emf discussed in Section 6.5 can be argued 
independently from Faraday’s law using the Lorentz force on moving 
charges. However, even if the charges are stationary [and the q (v x 
B) term of the Lorentz force is not operative], an emf is nevertheless 
induced in the presence of a time-varying magnetic field. Thus, 
moving charges in static field and static charges in a time-varying 
field seem to be symmetric situation for Faraday’s law. This gives a 
tantalising hint on the relevance of the principle of relativity for 
Faraday’s law. 

4. The motion of a copper plate is damped when it is allowed to 
oscillate between the magnetic pole-pieces. How is the damping 
force, produced by the eddy currents? 

Exercises 

6.1 Predict the direction of induced current in the situations described 
by the following Figs. 6.18(a) to (f). 
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Figure 6.18 

6.2 Use Lenz’s law to determine the direction of induced current in 
the situations described by Fig. 6.19: 

(a) A wire of irregular shape turning into a circular shape; 

(b) A circular loop being deformed into a narrow straight wire. 
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6.3 A long solenoid with 15 turns per cm has a small loop of area 2.0 

cm 2 placed inside the solenoid normal to its axis. If the current carried 
by the solenoid changes steadily from 2.0 A to 4.0 A in 0.1 s, what is 
the induced emf in the loop while the current is changing? 

6.4 A rectangular wire loop of sides 8 cm and 2 cm with a small cut is 
moving out of a region of uniform magnetic field of magnitude 0.3 T 
directed normal to the loop. What is the emf developed across the cut 
if the velocity of the loop is 1 cm s-1 in a direction normal to the (a) 
longer side, (b) shorter side of the loop? For how long does the 
induced voltage last in each case? 

6.5 A 1.0 m long metallic rod is rotated with an angular frequency of 
400 rad s -1 about an axis normal to the rod passing through its one 

end. The other end of the rod is in contact with a circular metallic ring. 
A constant and uniform magnetic field of 0.5 T parallel to the axis 
exists everywhere. Calculate the emf developed between the centre 
and the ring. 

6.6 A circular coil of radius 8.0 cm and 20 turns is rotated about its 
vertical diameter with an angular speed of 50 rad s -1 in a uniform 






horizontal magnetic field of magnitude 3.0 * 10 -2 T. Obtain the 
maximum and average emf induced in the coil. If the coil forms a 
closed loop of resistance 10 Q, calculate the maximum value of 
current in the coil. Calculate the average power loss due to Joule 
heating. Where does this power come from? 

6.7 A horizontal straight wire 10 m long extending from east to west is 
falling with a speed of 5.0 m s _1 , at right angles to the horizontal 
component of the earth’s magnetic field, 0.30 * 10 -4 Wb rrr 2 . 

(a) What is the instantaneous value of the emf induced in the wire? 

(b) What is the direction of the emf? 

(c) Which end of the wire is at the higher electrical potential? 

6.8 Current in a circuit falls from 5.0 A to 0.0 A in 0.1 s. If an average 
emf of 200 V induced, give an estimate of the self-inductance of the 
circuit. 

6.9 A pair of adjacent coils has a mutual inductance of 1.5 H. If the 
current in one coil changes from 0 to 20 A in 0.5 s, what is the 
change of flux linkage with the other coil? 

6.10 A jet plane is travelling towards west at a speed of 1800 km/h. 
What is the voltage difference developed between the ends of the 
wing having a span of 25 m, if the Earth’s magnetic field at the 

location has a magnitude of 5 * '\0~ 4 J and the dip angle is 30°. 
Additional Exercises 

6.11 Suppose the loop in Exercise 6.4 is stationary but the current 
feeding the electromagnet that produces the magnetic field is 
gradually reduced so that the field decreases from its initial value of 

0.3 T at the rate of 0.02 T s -1 . If the cut is joined and the loop has a 




resistance of 1.6 Q, how much power is dissipated by the loop as 
heat? What is the source of this power? 

6.12 A square loop of side 12 cm with its sides parallel to X and Y 
axes is moved with a velocity of 8 cm s -1 in the positive x-direction in 

an environment containing a magnetic field in the positive z-direction. 
The field is neither uniform in space nor constant in time. It has a 

gradient of 10 -3 T cm -1 along the negative x-direction (that is it 

increases by 10 _3 T cm -1 as one moves in the negative x-direction), 

and it is decreasing in time at the rate of 10 -3 T s -1 . Determine the 
direction and magnitude of the induced current in the loop if its 
resistance is 4.50 mQ. 

6.13 It is desired to measure the magnitude of field between the 
poles of a powerful loud speaker magnet. A small flat search coil of 

area 2 cm 2 with 25 closely wound turns, is positioned normal to the 
field direction, and then quickly snatched out of the field region. 
Equivalently, one can give it a quick 90° turn to bring its plane parallel 
to the field direction). The total charge flown in the coil (measured by 
a ballistic galvanometer connected to coil) is 

7.5 mC. The combined resistance of the coil and the galvanometer is 
0.50 Q. Estimate the field strength of magnet. 

6.14 Figure 6.20 shows a metal rod PQ resting on the smooth rails 
AB and positioned between the poles of a permanent magnet. The 
rails, the rod, and the magnetic field are in three mutual perpendicular 
directions. A galvanometer G connects the rails through a switch K. 
Length of the rod = 15 cm, B = 0.50 T, resistance of the closed loop 
containing the rod = 9.0 mQ. Assume the field to be uniform. 




(a) Suppose K is open and the rod is moved with a speed of 12 cm s - 

1 in the direction shown. Give the polarity and magnitude of the 
induced emf. 



S 


Figure 6.20 

(b) Is there an excess charge built up at the ends of the rods when K 
is open? What if K is closed? 

(c) With K open and the rod moving uniformly, there is no net force on 
the electrons in the rod PQ even though they do experience magnetic 
force due to the motion of the rod. Explain. 

(d) What is the retarding force on the rod when K is closed? 

(e) How much power is required (by an external agent) to keep the 

rod moving at the same speed (=12cm s -1 ) when K is closed? How 
much power is required when K is open? 

(f) How much power is dissipated as heat in the closed circuit? What 
is the source of this power? 

(g) What is the induced emf in the moving rod if the magnetic field is 
parallel to the rails instead of being perpendicular? 

6.15 An air-cored solenoid with length 30 cm, area of cross-section 
25 cm 2 and number of turns 500, carries a current of 2.5 A. The 
current is suddenly switched off in a brief time of 10 -3 s. How much is 























the average back emf induced across the ends of the open switch in 
the circuit? Ignore the variation in magnetic field near the ends of the 
solenoid. 


6.16 (a) Obtain an expression for the mutual inductance between a 
long straight wire and a square loop of side a as shown in Fig. 6.21. 


(b) Now assume that the straight wire carries a current of 50 A and 
the loop is moved to the right with a constant velocity, v = 10 m/s. 
Calculate the induced emf in the loop at the instant when x = 0.2 m. 
Take a = 0.1 m and assume that the loop has a large resistance. 
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Figure 6.21 

6.17 A line charge A per unit length is lodged uniformly onto the rim of 
a wheel of mass M and radius R. The wheel has light non-conducting 
spokes and is free to rotate without friction about its axis (Fig. 6.22). 
A uniform magnetic field extends over a circular region within the rim. 
It is given by, 

B = - B 0 k (r < a; a < R) 

= 0 (otherwise) 

What is the angular velocity of the wheel after the field is suddenly 
switched off? 












Figure 6.22 
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Chapter Seven 


Alternating Current 


7.1 Introduction 

We have so far considered direct current (dc) sources and circuits with dc 
sources. These currents do not change direction with time. But voltages 
and currents that vary with time are very common. The electric mains 
supply in our homes and offices is a voltage that varies like a sine 
function with time. Such a voltage is called alternating voltage (ac 
voltage) and the current driven by it in a circuit is called the alternating 
current (ac current)*. Today, most of the electrical devices we use require 
ac voltage. This is mainly because most of the electrical energy sold by 
power companies is transmitted and distributed as alternating current. 
The main reason for preferring use of ac voltage over dc voltage is that 
ac voltages can be easily and efficiently converted from one voltage to 
the other by means of transformers. Further, electrical energy can also be 
transmitted economically over long distances. AC circuits exhibit 
characteristics which are exploited in many devices of daily use. For 
example, whenever we tune our radio to a favourite station, we are taking 
advantage of a special property of ac circuits - one of many that you will 
study in this chapter. 

* The phrases ac voltage and ac current are contradictory and redundant, 
respectively, since they mean, literally, alternating current voltage and 
alternating current current. Still, the abbreviation ac to designate an 
electrical quantity displaying simple harmonic time dependance has 
become so universally accepted that we follow others in its use. Further, 
voltage - another phrase commonly used means potential difference 




between two points. 


7.2 AC Voltage Applied to a Resistor 

Figure 7.1 shows a resistor connected to a source s of ac voltage. The 
symbol for an ac source in a circuit diagram is ©. We consider a source 
which produces sinusoidally varying potential difference across its 
terminals. Let this potential difference, also called ac voltage, be given by 



Nicola Tesla (1856 - 1943) Serbian-American scientist, inventor and 
genius. He conceived the idea of the rotating magnetic field, which is 
the basis of practically all alternating current machinery, and which 
helped usher in the age of electric power. He also invented among 
other things the induction motor, the polyphase system of ac power, 
and the high frequency induction coil (the Tesla coil) used in radio 
and television sets and other electronic equipment. The SI unit of 
magnetic field is named in his honour. 

t; = v w sin <rn. (7.1) 

where v m is the amplitude of the oscillating potential difference and w is 
its angular frequency. 



To find the value of current through the resistor, we apply Kirchhoff’s loop 
rule ^ e(f) = 0 (refer to Section 3.13), to the circuit shown in Fig. 7.1 to 
get 



Figure 7.1 AC voltage applied to a resistor. 


v } „ sin mi - i R 

. v m . 

or i = —sinr^r 

R 

Since R is a constant, we can write this equation as 
i = i,,, sin coi (7.2) 

where the current amplitude i m is given by 

i = — (7.3) 

R 

Equation (7.3) is Ohm’s law, which for resistors, works equally well for 
both ac and dc voltages. The voltage across a pure resistor and the 
current through it, given by Eqs. (7.1) and (7.2) are plotted as a function 
of time in Fig. 7.2. Note, in particular that both v and i reach zero, 
minimum and maximum values at the same time. Clearly, the voltage and 
current are in phase with each other. 









FIGURE 7.2 In a pure resistor, the voltage and current are in phase. The minima, zero and 

maxima 

occur at the same respective times. 

We see that, like the applied voltage, the current varies sinusoidally and 
has corresponding positive and negative values during each cycle. Thus, 
the sum of the instantaneous current values over one complete cycle is 
zero, and the average current is zero. The fact that the average current is 
zero, however, does not mean that the average power consumed is zero 
and that there is no dissipation of electrical energy. As you know, Joule 

heating is given by i 2 R and depends on i 2 (which is always positive 
whether i is positive or negative) and not on i. Thus, there is Joule 
heating and dissipation of electrical energy when an ac current passes 
through a resistor. 

The instantaneous power dissipated in the resistor is 
p= i 2 R = if.'R sin 2 cot (7.4) 

The average value of p over a cycle is* 
p = <i 2 R > = <: sin 2 cot > [7.5(a)] 

where the bar over a letter (here, p) denotes its average value and <.> 

denotes taking average of the quantity inside the bracket. Since, i 2 m and 





R are constants, 


p - if u R < siir ox > [7.5(b)] 

Using the trigonometric identity, sin 2 cot = 1/2 (1- cos 2cot), we have < 
sin 2 cot > = (1/2) (1- < cos 2cot >) and since < cos2cot > = 0**, we have, 

. 1 

< Sill cot > 

2 



George Westinghouse (1846 - 1914) A leading proponent of the 
use of alternating current over direct current. Thus, he came into 
conflict with Thomas Alva Edison, an advocate of direct current. 
Westinghouse was convinced that the technology of alternating 
current was the key to the electrical future. He founded the famous 
Company named after him and enlisted the services of Nicola Tesla 
and 

other inventors in the development of alternating current motors and 
apparatus for the transmission of high tension current, pioneering in 
large scale lighting. 




[7.5(c)] 


Thus, p 



To express ac power in the same form as dc power (P = l 2 R), a special 
value of current is defined and used. It is called, root mean square (rms) 
or effective current (Fig. 7.3) and is denoted by l rms or I. 


It is defined by 

It 



Figure 7.3 The rms current I is related to the peak current im by I = f j J‘2 = 0.707 im. 

* The average value of a function F (t) over a period T is given by 

1 J 

(F(t))=^]F(t)dt 


1 T 1 

<cos2ivf>= fcos2ivtdt = 

T o T 


sin 2wt 


IT 


2w 


-J0 


2 wT 


[sin 2w T - 0| = 0 














(7.6) 


= 0.707 i m 

In terms of I, the average power, denoted by P is 
P = p= i; R I R (7.7) 

Similarly, we define the rms voltage or effective voltage by 

V = = 0.707 v m (7.8) 

\ 2 

From Eq. (7.3), we have 


or, 


iL 


- - 1 % R 
2 


v2 V' 

or, V = IR 


(7.9) 


Equation (7.9) gives the relation between ac current and ac voltage and 
is similar to that in the dc case. This shows the advantage of introducing 
the concept of rms values. In terms of rms values, the equation for power 
[Eq. (7.7)] and relation between current and voltage in ac circuits are 
essentially the same as those for the dc case. 

It is customary to measure and specify rms values for ac quantities. For 
example, the household line voltage of 220 V is an rms value with a peak 
voltage of 

v m= v 2 V = (1.414)(220 V) = 311 V 

In fact, the I or rms current is the equivalent dc current that would 
produce the same average power loss as the alternating current. 



Equation (7.7) can also be written as 
P = V 2 / R = I V (since V = I R) 


Example 7.1 A light bulb is rated at 100W for a 220 V supply. Find (a) 
the resistance of the bulb; (b) the peak voltage of the source; and (c) 
the rms current through the bulb. 

Solution 

(a) We are given P = 100 W and V = 220 V. The resistance of the 
bulb is 

V' L (220 VY 
R= = ' =4 84 Q 

p mow 


(b) The peak voltage of the source is 

v 72 V 311V 


(c) Since, P = I V 


P _ 100 W 
V 220 V 


0.434 A 


7.3 Representation of AC Current and Voltage by Rotating 
Vectors — Phasors 

In the previous section, we learnt that the current through a resistor is in 
phase with the ac voltage. But this is not so in the case of an inductor, a 
capacitor or a combination of these circuit elements. In order to show 
phase relationship between voltage and current in an ac circuit, we use 
the notion of phasors. The analysis of an ac circuit is facilitated by the 




use of a phasor diagram. A phasor* is a vector which rotates about the 
origin with angular speed co, as shown in Fig. 7.4. The vertical 
components of phasors V and I represent the sinusoidally varying 
quantities v and i. The magnitudes of phasors V and I represent the 
amplitudes or the peak values v m and i m of these oscillating quantities. 

Figure 7.4(a) shows the voltage and current phasors and their 
relationship at time for the case of an ac source connected to a resistor 

i.e., corresponding to the circuit shown in Fig. 7.1. The projection of 
voltage and current phasors on vertical axis, i.e., v m sinw t and i m sinco t, 

respectively represent the value of voltage and current at that instant. As 
they rotate with frequency co, curves in Fig. 7.4(b) are generated. 

From Fig. 7.4(a) we see that phasors V and I for the case of a resistor 
are in the same direction. This is so for all times. This means that the 
phase angle between the voltage and the current is zero. 



Figure 7.4 (a) A phasor diagram for the circuit in Fig 7. 1. (b) Graph of v and i versus cot. 


7.4 AC Voltage Applied to an Inductor 

Figure 7.5 shows an ac source connected to an inductor. Usually, 
inductors have appreciable resistance in their windings, but we shall 
assume that this inductor has negligible resistance. Thus, the circuit is a 
purely inductive ac circuit. Let the voltage across the source be v = v m 











sinco t. Using the Kirchhoff’s loop rule, ^ e(r) = 0 , and since there is no 
resistor in the circuit, 

u - L — - 0 (7.10) 

d; 

where the second term is the self-induced Faraday emf in the inductor; 
and L is the self-inductance of the inductor. The negative sign follows 
from Lenz’s law (Chapter 6). Combining Eqs. (7.1) and (7.10), we have 



FIGURE 7.5 An ac source connected to an inductor. 


* Though voltage and current in ac circuit are represented by phasors - 
rotating vectors, they are not vectors themselves. They are scalar 
quantities. It so happens that the amplitudes and phases of harmonically 
varying scalars combine mathematically in the same way as do the 
projections of rotating vectors of corresponding magnitudes and 
directions. The rotating vectors that represent harmonically varying scalar 
quantities are introduced only to provide us with a simple way of adding 
these quantities using a rule that we already know. 

— — -^sinftrt (7.11) 

ri/ L L 


Equation (7.11) implies that the equation for i(t), the current as a function 











of time, must be such that its slope di/dt is a sinusoidally varying quantity, 
with the same phase as the source voltage and an amplitude given by 
v m /L. To obtain the current, we integrate di/dt with respect to time: 

( ~ dt = 1 sin(ivr)dr 

and get, 
r 

i = - T,; cos(cot) + constant 
coL 

The integration constant has the dimension of current and is time- 
independent. Since the source has an emf which oscillates symmetrically 
about zero, the current it sustains also oscillates symmetrically about 
zero, so that no constant or time-independent component of the current 
exists. Therefore, the integration constant is zero. 

Using 


cos( iv t) = sin 



we have 


' = i r „sin wt-^j (7.12) 

v 

where i m = ;! is the amplitude of the current. The quantity w L is 

to L 

analogous to the resistance and is called inductive reactance, denoted by 
X L : 

X L = uj L (7.13) 


The amplitude of the current is, then 



(7.14) 


rr:i 


X s 


PHYSICS 

Interactive animation on Phasor diagrams of ac circuits containing, R, 

L, C and RLC series circuits: 

http://www.animations.physics.unsw.edu.aU//jw/AC.html 

The dimension of inductive reactance is the same as that of resistance 
and its SI unit is ohm (Q). The inductive reactance limits the current in a 
purely inductive circuit in the same way as the resistance limits the 
current in a purely resistive circuit. The inductive reactance is directly 
proportional to the inductance and to the frequency of the current. 

A comparison of Eqs. (7.1) and (7.12) for the source voltage and the 
current in an inductor shows that the current lags the voltage by tt/2 or 
one-quarter (1/4) cycle. Figure 7.6 (a) shows the voltage and the current 
phasors in the present case at instant t|. The current phasor I is tt/2 

behind the voltage phasor V. When rotated with frequency w counter¬ 
clockwise, they generate the voltage and current given by Eqs. (7.1) and 
(7.12), respectively and as shown in Fig. 7.6(b). 




(a) (b) 

Figure 7.6 (a) A Phasor diagram for the circuit in Fig. 7.5. (b) Graph of v and i versus cot. 


We see that the current reaches its maximum value later than the voltage 

T _ ti/2 
4 ' 


by one-fourth of a period 


w 


. You have seen that an inductor 


has reactance that limits current similar to resistance in a dc circuit. 
Does it also consume power like a resistance? Let us try to find out. The 
instantaneous power supplied to the inductor is 


Pl = 1 v = L slnf w t - ^ jxi!„ sln(wf) 


-i n v ni cos (/>)r) sin (V?)f) 


2 


sm 


So, the average power over a complete cycle is 


Vi 


2 


sin (2rot ) 




(sm(2cot)) = 0, 


since the average of sin (2ojt) over a complete cycle is zero. Thus, the 













average power supplied to an inductor over one complete cycle is 
zero. Figure 7.7 explains it in detail. 


Example 7.2 A pure inductor of 25.0 mH is connected to a source of 
220 V. Find the inductive reactance and rms current in the circuit if 
the frequency of the source is 50 Hz. 

Solution The inductive reactance, 

X. =2rtVl = ?. ca!4 ■ 50 ■ 9.5 ■ hr W 

Li 

= 7.85Q 

The rms current in the circuit is 

V 220 V . . 

I = = = 28 A 

X L 7.80 ii 



0-1 Current i through the coil entering at A increase from zero to a 
maximum value. Flux lines are set up i.e., the core gets magnetised. With 
























the polarity shown voltage and current are both positive. So their 
product p is positive. ENERGY IS ABSORBED FROM THE SOURCE. 



1-2 Current in the coil is still positive but is decreasing. The core gets 
demagnetised and the net flux becomes zero at the end of a half cycle. 
The voltage v is negative (since di/dt is negative). The product of voltage 
and current is negative, and ENERGY IS BEING RETURNED TO 
SOURCE. 


0 12 3 4 



One complete cycle of voltage/current. Note that the current lags the 

voltage. 






















2-3 Current i becomes negative i.e., it enters at B and comes out of A. 
Since the direction of current has changed, the polarity of the magnet 
changes. The current and voltage are both negative. So their product p is 
positive. ENERGY IS ABSORBED. 



3-4 Current i decreases and reaches its zero value at 4 when core is 
demagnetised and flux is zero. The voltage is positive but the current is 
negative. The power is, therefore, negative. ENERGY ABSORBED 
DURING THE CYCLE 2-3 IS RETURNED TO THE SOURCE. 


Figure 7.7 Magnetisation and demagnetisation of an inductor. 


7.5 AC Voltage Applied to a Capacitor 




















































Figure 7.8 shows an ac source £ generating ac voltage v = v m sin cot 

connected to a capacitor only, a purely capacitive ac circuit. 

When a capacitor is connected to a voltage source in a dc circuit, current 
will flow for the short time required to charge the capacitor. As charge 
accumulates on the capacitor plates, the voltage across them increases, 
opposing the current. That is, a capacitor in a dc circuit will limit or 
oppose the current as it charges. When the capacitor is fully charged, the 
current in the circuit falls to zero. 

When the capacitor is connected to an ac source, as in Fig. 7.8, it limits 
or regulates the current, but does not completely prevent the flow of 
charge. The capacitor is alternately charged and discharged as the 
current reverses each half cycle. Let q be the charge on the capacitor at 
any time t. The instantaneous voltage v across the capacitor is 


A B 



v = (7.15) 

C 

From the Kirchhoff’s loop rule, the voltage across the source and the 
capacitor are equal, 

Q 

l> sin tot. — 

C 








To find the current, we use the relation i - — 

dt 


i 


dt 


(u m Csin&>£) 


o)C v m cos(&>£) 


Using the relation, cos(iv f) = sin 


k ' 

w — 

2 ) 


, we have 


t= f m sin^/r + ^ (7.16) 

where the amplitude of the oscillating current is i m = w Cv m . We can 
rewrite it as 


[\/wC] 

Comparing it to i m = v m /R for a purely resistive circuit, we find that 

(l/wC) plays the role of resistance. It is called capacitive reactance and is 
denoted by X c , 

X c = 1 /ojC (7.17) 


so that the amplitude of the current is 

v. 


bn ~ 


x c 


(7.18) 




FIGURE 7.9 (a) A Phasor diagram for the circuit in Fig. 7.8. (b) Graph of v and i versus to t. 

The dimension of capacitive reactance is the same as that of resistance 
and its SI unit is ohm (Q). The capacitive reactance limits the amplitude 
of the current in a purely capacitive circuit in the same way as the 
resistance limits the current in a purely resistive circuit. But it is inversely 
proportional to the frequency and the capacitance. 

A comparison of Eq. (7.16) with the equation of source voltage, Eq. (7.1) 
shows that the current is tt/2 ahead of voltage. 

Figure 7.9(a) shows the phasor diagram at an instant t|. Here the current 

phasor I is tt/2 ahead of the voltage phasor V as they rotate 
counterclockwise. Figure 7.9(b) shows the variation of voltage and 
current with time. We see that the current reaches its maximum value 
earlier than the voltage by one-fourth of a period. 

The instantaneous power supplied to the capacitor is 

Pc = ' v = 'm cos(wt)v m sin(wt) 

= i m v m cos(wt) sin(wt) 

= sin(2arf) (7.19) 

2 

So, as in the case of an inductor, the average power 








P r = /^^sin(2<yr)) - ^^/sin(2^T) = 0 

i O } O ' 


since <sin (2wt)> = 0 over a complete cycle. Figure 7.10 explains it in 
detail. 


Thus, we see that in the case of an inductor, the current lags the voltage 
by tt/2 and in the case of a capacitor, the current leads the voltage by 
tt/2. 


A B 
+ 

±. — 

+ 

+ 



0-1 The current i flows as shown and from the maximum at 0, reaches a 
zero value at 1. The plate A is charged to positive polarity while negative 
charge q builds up in B reaching a maximum at 1 until the current 
becomes zero. The voltage vc = q/C is in phase with q and reaches 
maximum value at 1. Current and voltage are both positive. So p = vci is 
positive. 

Energy is absorbed from THE SOURCE during this quarter cycle AS THE 
CAPACITOR IS CHARGED. 



A 


R 



2-3 As i continues to flow from A to B, the capacitor is charged to 
reversed polarity i.e., the plate B acquires positive and A acquires 
negative charge. Both the current and the voltage are negative. Their 
product p is positive. The capacitor ABSORBS ENERGY during this 1/4 
cycle. 

0 Charge Discharge^ Charge ^Discharge^ 



One complete cycle of voltage/current. Note that the current leads the 
voltage. 




B 


,4 



1-2 The current i reverses its direction. The accumulated charge is 
depleted i.e., the capacitor is discharged during this quarter cycle.The 
voltage gets reduced but is still positive. The current is negative. Their 
product, the power is negative. 

the Energy absorbed during THE 1/4 CYCLE 0-1 IS RETURNED 
DURING this QUARTER. 

A B 



3-4 The current i reverses its direction at 3 and flows from B to A. The 
accumulated charge is depleted and the magnitude of the voltage vc is 
reduced, vc becomes zero at 4 when the capacitor is fully discharged. 
The power is negative.ENERGY ABSORBED DURING 2-3 IS 
RETURNED TO THE SOURCE. NET ENERGY ABSORBED IS ZERO. 

Figure 7.10 Charging and discharging of a capacitor. 


Example 7.3 A lamp is connected in series with a capacitor. Predict 
your observations for dc and ac connections. What happens in each 


















case if the capacitance of the capacitor is reduced? 

Solution When a dc source is connected to a capacitor, the capacitor 
gets charged and after charging no current flows in the circuit and the 
lamp will not glow. There will be no change even if C is reduced. With 
ac source, the capacitor offers capacitative reactance (1/wC) and the 
current flows in the circuit. Consequently, the lamp will shine. 
Reducing C will increase reactance and the lamp will shine less 
brightly than before. 


Example 7.4 A 15.0 pF capacitor is connected to a 220 V, 50 Hz 
source. Find the capacitive reactance and the current (rms and peak) 
in the circuit. If the frequency is doubled, what happens to the 
capacitive reactance and the current? 

Solution The capacitive reactance is 

XM 1 = ——- 1 r =212 Cl 

2 x vC 2 k (5< »Hz)(l 1 O F) 

The rms current is 

/= V = 220V =l.O4A 
X [; 212 12 

The peak current is 

V - V2I- (L41)(1.(MA)- 1A7A 

This current oscillates between +1.47A and -1.47 A, and is ahead of 
the voltage by tt/2. 

If the frequency is doubled, the capacitive reactance is halved and 
consequently, the current is doubled. 






Example 7.5 A light bulb and an open coil inductor are connected to 
an ac source through a key as shown in Fig. 7.11. 

£ 

— 'YnvflTn p - w - 

-0- s - 

Figure 7.11 

The switch is closed and after sometime, an iron rod is inserted into 
the interior of the inductor. The glow of the light bulb (a) increases; (b) 
decreases; (c) is unchanged, as the iron rod is inserted. Give your 
answer with reasons. 

Solution As the iron rod is inserted, the magnetic field inside the coil 
magnetizes the iron increasing the magnetic field inside it. Hence, the 
inductance of the coil increases. Consequently, the inductive 
reactance of the coil increases. As a result, a larger fraction of the 
applied ac voltage appears across the inductor, leaving less voltage 
across the bulb. Therefore, the glow of the light bulb decreases. 


7.6 AC Voltage Applied to a Series LCR Circuit 

Figure 7.12 shows a series LCR circuit connected to an ac source £. As 
usual, we take the voltage of the source to be v = v m sin wt. 

If q is the charge on the capacitor and i the current, at time t, we have, 
from Kirchhoff’s loop rule: 









v (7.20) 


L 


di 

tU 


u R i 


q 

c 


We want to determine the instantaneous current i and its phase 
relationship to the applied alternating voltage v. We shall solve this 
problem by two methods. First, we use the technique of phasors and in 
the second method, we solve 

Eq. (7.20) analytically to obtain the time-dependence of i. 

R 

- AWv - 

e c 

- nmrp - 


FIGURE 7 .12 A series LCR circuit connected to an ac source. 

7.6.1 Phasor-diagram solution 

From the circuit shown in Fig. 7.12, we see that the resistor, inductor and 
capacitor are in series. Therefore, the ac current in each element is the 
same at any time, having the same amplitude and phase. Let it be 

i = i m sin(wt+(p) (7.21) 

where (p is the phase difference between the voltage across the source 
and the current in the circuit. On the basis of what we have learnt in the 
previous sections, we shall construct a phasor diagram for the present 
case. 

Let I be the phasor representing the current in the circuit as given by Eq. 
(7.21). Further, let V L , V R , V c , and V represent the voltage across the 












inductor, resistor, capacitor and the source, respectively. From previous 
section, we know that V R is parallel to I, V c is tt/2 behind I and V L is tt/2 

ahead of I. V L , V R , V c and I are shown in Fig. 7.13(a) with apppropriate 

phase-relations. 

The length of these phasors or the amplitude of V R , V c and V L are: 

v Rm = 'm R . v Cm = 'm X C. v Lm = 'm X L ( 7 - 22 ) 

The voltage Equation (7.20) for the circuit can be written as 
v L + v R + v c = v (7.23) 

The phasor relation whose vertical component gives the above equation 
is 

V L + V R + V c = V (7.24) 

This relation is represented in Fig. 7.13(b). Since V c and V L are always 

along the same line and in opposite directions, they can be combined into 
a single phasor (V c + V L ) which has a magnitude | v Cm - v Lm |. Since V is 

represented as the hypotenuse of a right-triangle whose sides are V R and 

(V c + V L ), the Pythagorean theorem gives: 

4 = 4n, + (>'r: m 

Substituting the values of v Rm , v Cm , and v Lm from Eq. (7.22) into the 
above equation, we have 

4 - (i^R) 2 + H n x c - i m X L f 


= t [R 2 +{X c -X L f 



[7.25(a)] 


or, 



By analogy to the resistance in a circuit, we introduce the impedance Z in 
an ac circuit: 



(a) 



FIGURE 7.13 (a) Relation between the phasors V L, V R, VC, and I, (b) Relation between the 


phasors V L, V R, and (V L + V C ) for the circuit in Fig. 7.11. 


^ [7.25(b)] 

where Z = jl? • I V V i (7.26) 

Since phasor I is always parallel to phasor V R , the phase angle cp is the 
angle between V R and V and can be determined from Fig. 7.14: 






R 


FIGURE 7.14 Impedance diagram. 


Using Eq. (7.22), we have 

tarns) (7.27) 

R 

Equations (7.26) and (7.27) are graphically shown in Fig. (7.14). This is 
called Impedance diagram which is a right-triangle with Z as its 
hypotenuse. 

Equation 7.25(a) gives the amplitude of the current and Eq. (7.27) gives 
the phase angle. With these, Eq. (7.21) is completely specified. 

If X c > X L , (p is positive and the circuit is predominantly capacitive. 

Consequently, the current in the circuit leads the source voltage. If X c < 

X L , cp is negative and the circuit is predominantly inductive. Consequently, 

the current in the circuit lags the source voltage. 

Figure 7.15 shows the phasor diagram and variation of v and i with co t for 

the case X c > X L . 











FIGURE 7.15 (a) Phasor diagram of V and I. (b) Graphs of v and i versus u>t for a series 

LCR circuit where X C> X L . 

Thus, we have obtained the amplitude and phase of current for an LCR 
series circuit using the technique of phasors. But this method of 
analysing ac circuits suffers from certain disadvantages. First, the phasor 
diagram say nothing about the initial condition. One can take any 
arbitrary value of t (say, t 1 , as done throughout this chapter) and draw 

different phasors which show the relative angle between different 
phasors. 

The solution so obtained is called the steady-state solution. This is not a 
general solution. Additionally, we do have a transient solution which 
exists even for 

v = 0. The general solution is the sum of the transient solution and the 
steady-state solution. After a sufficiently long time, the effects of the 
transient solution die out and the behaviour of the circuit is described by 
the steady-state solution. 

7.6.2 Analytical solution 

The voltage equation for the circuit is 









We know that i = dq/dt. Therefore, di/dt = d 2 q/dt 2 . Thus, in terms of q, the 
voltage equation becomes 


//’? is' 1 '' 

dt^ dt 


C 


'm ;S ’ T1 


(7.28) 


This is like the equation for a forced, damped oscillator, [see Eq. 
{14.37(b)} in Class XI Physics Textbook]. Let us assume a solution 


q = q m sin (w t + 0) [7.29(a)] 


so that — = q 7n (9i:os[(ot. + 0\ [7.29(b)] 
d / 

and -— ( J- = -q n ,o/ sin(ftd + 0) [7.29(c)] 
dr" 

Substituting these values in Eq. (7.28), we get 

q t //)[ K rosier \ O) - (A',-. — A\ ) sin(/?J t + #)] = sin mi (7.30) 

where we have used the relation X c = l/wC, X L = w L. Multiplying and 
dividing Eq. (7.30) by 7 - Jr 2 + (x, - X, f - we have 


C Im W Z 


(A 

— cos(ev t + q} + ^—^^sin(cv t + q) 


R 

Z 


- v n . sin ojt (7.31) 


Now, let — - cos 0 
Z 




Substituting this in Eq. (7.31) and simplifying, we get: 
q m a)Zcos(mt I 0 $) = sin cot (7.33) 
Comparing the two sides of this equation, we see that 

- Qm > ~ hn '■ 

where 


q,., co [7.33(a)] 


and 6 -0 = - 71 or 6= - 77 + <t> [7.33(b)] 
2 2 

Therefore, the current in the circuit is 
dr? 

i = — = + 0) 

dl. 

= i m cos(wt + 0) 

or i = i m sin(a)t + cp) (7.34) 



Thus, the analytical solution for the amplitude and phase of the current in 
the circuit agrees with that obtained by the technique of phasors. 


7.6.3 Resonance 



An interesting characteristic of the series RLC circuit is the phenomenon 
of resonance. The phenomenon of resonance is common among 
systems that have a tendency to oscillate at a particular frequency. This 
frequency is called the system’s natural frequency. If such a system is 
driven by an energy source at a frequency that is near the natural 
frequency, the amplitude of oscillation is found to be large. A familiar 
example of this phenomenon is a child on a swing. The swing has a 
natural frequency for swinging back and forth like a pendulum. If the child 
pulls on the rope at regular intervals and the frequency of the pulls is 
almost the same as the frequency of swinging, the amplitude of the 
swinging will be large (Chapter 14, Class XI). 

For an RLC circuit driven with voltage of amplitude v m and frequency w, 
we found that the current amplitude is given by 
v v 

~ h ~l _ 

^ <jR 2 + (X,; ~ A', ) 2 

with X c = 1/wC and X L = w L. So if w is varied, then at a particular 
frequency u) 0 , X c = X L , and the impedance is minimum 

(z - yjR 2 () 2 - R ] . This frequency is called the resonant frequency: 


X = A\ or —-— = L 

ni.C 


or co,, 


1 


JLC 


(7.35) 




FIGURE 7 .16 Variation of i m with to for two cases: (i) R = 100 Q, (ii) R = 200 Q, L = 1.00 mH. 

At resonant frequency, the current amplitude is maximum; i m = v m /R. 


Figure 7.16 shows the variation of i m with co in a RLC series circuit with L 
= 1.00 mH, C = 

1.00 nF for two values of R: (i) R = 100 Q 

and (ii) R = 200 Q. For the source applied v m = 100 V. qj 0 for this case is 

1 ") R 

= I.OOxlO 6 


^ Vic 

rad/s. 


We see that the current amplitude is maximum at the resonant frequency. 
Since i m = v m / R at resonance, the current amplitude for case (i) is twice 

to that for case (ii). 

Resonant circuits have a variety of applications, for example, in the 
tuning mechanism of a radio or a TV set. The antenna of a radio accepts 
signals from many broadcasting stations. The signals picked up in the 
antenna acts as a source in the tuning circuit of the radio, so the circuit 



























can be driven at many frequencies. But to hear one particular radio 
station, we tune the radio. In tuning, we vary the capacitance of a 
capacitor in the tuning circuit such that the resonant frequency of the 
circuit becomes nearly equal to the frequency of the radio signal 
received. When this happens, the amplitude of the current with the 
frequency of the signal of the particular radio station in the circuit is 
maximum. 

It is important to note that resonance phenomenon is exhibited by a 
circuit only if both L and C are present in the circuit. Only then do the 
voltages across L and C cancel each other (both being out of phase) and 
the current amplitude is v m /R, the total source voltage appearing across 

R. This means that we cannot have resonance in a RL or 
RC circuit. 

Sharpness of resonance 

The amplitude of the current in the series LCR circuit is given by 


m 



and is maximum when m _ o.\, - 1 / JL C . The maximum value is 

For values of w other than w 0 , the amplitude of the current is less than 

the maximum value. Suppose we choose a value of w for which the 
current amplitude is i / v 2 times its maximum value. At this value, the 

power dissipated by the circuit becomes half. From the curve in Fig. 
(7.16), we see that there are two such values of w, say, w 1 and qj 2 , one 







greater and the other smaller than w 0 and symmetrical about w 0 . We 
may write 
oj-i = w 0 + Aw 


w 2 = w 0 - Aw 

The difference w-| - w 2 = 2Aw is often called the bandwidth of the circuit. 

The quantity (w 0 / 2Aw) is regarded as a measure of the sharpness of 

resonance. The smaller the Aw , the sharper or narrower is the 
resonance. 

To get an expression for Aw , we note that the current amplitude i m is 
(1 / V2 ) i for w 1 = w 0 + Aw. Therefore, 
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which may be written as, 
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Using a\' } = . in the second term on the left hand side, we get 
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We can approximate 
Therefore, 
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[7.36(a)] 


The sharpness of resonance is given by, 

XX- - XXl [7.36(b)] 

2Aa> R 

co L 

The ratio — - is also called the quality factor, Q of the circuit. 

R 


« 1 . 



Q 


a) 0 L 


[7.36(c)] 


From Eqs. [7.36 (b)] and [7.36 (c)], we see that 2Aw 


oj () 

— . So, larger 


the value of Q, the smaller is the value of 2Aw or the bandwidth and 
sharper is the resonance. Using ax: - J/LC, Eq. [7.36(c)] can be 
equivalently expressed as Q = 1/w 0 CR. 


We see from Fig. 7.15, that if the resonance is less sharp, not only is the 
maximum current less, the circuit is close to resonance for a larger range 
Aw of frequencies and the tuning of the circuit will not be good. So, less 
sharp the resonance, less is the selectivity of the circuit or vice versa. 
From Eq. (7.36), we see that if quality factor is large, i.e., R is low or L is 
large, the circuit is more selective. 


Example 7.6 A resistor of 200 Q and a capacitor of 15.0 pF are 
connected in series to a 220 V, 50 Hz ac source, (a) Calculate the 
current in the circuit; (b) Calculate the voltage (rms) across the 
resistor and the capacitor. Is the algebraic sum of these voltages 
more than the source voltage? If yes, resolve the paradox. 

Solution 

Given 

R = 200Q., C = 15.0(iF = 15.0xl0^F 

V = 220 V. v = 50IIz 

(a) In order to calculate the current, we need the impedance of the 
circuit. It is 





Z X'l = y[R 9 I 12* yCp- 

- ^2 00 nj 1 + ('2x5. i 4 x 50 x 1 5.9 x I rr' : F) 1 


- ^(200O] 1 +[212.312]" 

- 291.67 U 


Therefore, the current in the circuit is 

V 220 V 

/ = — =-= 0.750 A 

Z 29 1.50 

(b) Since the current is the same throughout the circuit, we have 
V* = JJ? = (0.755A)(200£2) = 151V 

V c = lX c = (0.755Al(21 2.301= 160.3V 

The algebraic sum of the two voltages, VR and VC is 311.3 V which 
is more than the source voltage of 220 V. How to resolve this 
paradox? As you have learnt in the text, the two voltages are not in 
the same phase. Therefore, they cannot be added like ordinary 
numbers. The two voltages are out of phase by ninety degrees. 
Therefore, the total of these voltages must be obtained using the 
Pythagorean theorem: 


V R J - + V/: 

= 220 V 

Thus, if the phase difference between two voltages is properly taken 
into account, the total voltage across the resistor and the capacitor is 
equal to the voltage of the source. 










7.7 Power in AC Circuit: The Power Factor 

We have seen that a voltage v = v m sinwt applied to a series RLC circuit 
drives a current in the circuit given by i = i m sin(o)t + cp) where 


v m M ^ , _! ( X c - X L \ 

and f = tan —--- 

Z { R 


Therefore, the instantaneous power p supplied by the source is 

p = vi = ( v m sin ojL) x [ i m sliil'i/j i + dj] 


v. x 


{'osifcos(2 o)t + (7.37) 


The average power over a cycle is given by the average of the two terms 
in R.H.S. of Eq. (7.37). It is only the second term which is time- 
dependent. Its average is zero (the positive half of the cosine cancels the 
negative half). Therefore, 


i' i V i 

p = tv™ cos 0 - ™ 

2 Vz vz 


COS 0 


= V r / ct)s 0 [7.38(a)] 


This can also be written as, 
P = I' 1 Zcos0 [7.38(b)] 


So, the average power dissipated depends not only on the voltage and 
current but also on the cosine of the phase angle (p between them. The 
quantity coscp is called the power factor. Let us discuss the following 
cases: 


Case (i) Resistive circuit: If the circuit contains only pure R, it is called 
resistive. In that case cp = 0, cos cp = 1. There is maximum power 







dissipation. 

Case (ii) Purely inductive or capacitive circuit: If the circuit contains only 
an inductor or capacitor, we know that the phase difference between 
voltage and current is tt/2. Therefore, cos cp = 0, and no power is 
dissipated even though a current is flowing in the circuit. This current is 
sometimes referred to as wattless current. 

Case (iii) LCR series circuit: In an LCR series circuit, power dissipated is 
given by Eq. (7.38) where (p = tan -1 (X c - X L )/ R. So, cp may be non-zero 

in a RL or RC or RCL circuit. Even in such cases, power is dissipated 
only in the resistor. 

Case (iv) Power dissipated at resonance in LCR circuit: At resonance 
X c - X L = 0, and cp = 0. Therefore, coscp = 1 and P = l 2 Z = I 2 R. That is, 
maximum power is dissipated in a circuit (through R) at resonance. 


Example 7.7 (a) For circuits used for transporting electric power, a 
low power factor implies large power loss in transmission. Explain. 

(b) Power factor can often be improved by the use of a capacitor of 
appropriate capacitance in the circuit. Explain. 

Solution (a) We know that P = I V coscp where coscp is the power 
factor. To supply a given power at a given voltage, if coscp is small, we 
have to increase current accordingly. But this will lead to large power 
loss (I2R) in transmission. 

(b)Suppose in a circuit, current I lags the voltage by an angle cp. Then 
power factor coscp =R/Z. 

We can improve the power factor (tending to 1) by making Z tend to 



R. Let us understand, with the help of a phasor diagram (Fig. 7.17) 

how this can be achieved. Let us resolve I into two components. Ip 

along the applied voltage V and Iq perpendicular to the applied 

voltage. Iq as you have learnt in Section 7.7, is called the wattless 

component since corresponding to this component of current, there is 

no power loss. IP is known as the power component because it is in 

phase with the voltage and corresponds to power loss in the circuit. 

Va 



Figure 7.17 

It’s clear from this analysis that if we want to improve power factor, 
we must completely neutralize the lagging wattless current Iq by an 
equal leading wattless current I'q. This can be done by connecting a 
capacitor of appropriate value in parallel so that Iq and I'q cancel 
each other and P is effectively Ip V. 

Example 7.8 A sinusoidal voltage of peak value 283 V and frequency 
50 Hz is applied to a series LCR circuit in which 

R = 3 Q, L = 25.48 mH, and C = 796 pF. Find (a) the impedance of 





the circuit; (b) the phase difference between the voltage across the 
source and the current; (c) the power dissipated in the circuit; and (d) 
the power factor. 

Solution 

(a) To find the impedance of the circuit, we first calculate XL and XC. 
XL = 2 ttvL 

= 2 x 3.14 x 50 x 25.48 x IQ- 3 Q = 8 Q 



= 4 Q. 


Therefore, 


z = Jr 2 + {X L - X c ) 2 = V3 1 + (8 - if 

= 5Q 


v _ x 

(b) Phase difference, (p = tan -1 ., 



Since cp is negative, the current in the circuit lags the voltage across 
the source. 

(c) The power dissipated in the circuit is 

P = I 2 R 












Therefore, P = (40A) 2 x?,.Q= 1800W 
(d) Power factor = cos / = cos ( 53.1 °) = 0.6 


Example 7.9 Suppose the frequency of the source in the previous 
example can be varied, (a) What is the frequency of the source at 
which resonance occurs? (b) Calculate the impedance, the current, 
and the power dissipated at the resonant condition. 

Solution 

(a) The frequency at which the resonance occurs is 

1 1 

■JiT V?5. ISxTo ^<755x 10 

- 222.1 rad/s 
m- 221.1 

i- = -= Hz = 35.4 Hz 

2tu 2x3.14 

(b) The impedance Z at resonant condition is equal to the resistance: 

Z = R = 3C1 


The rms current at resonance is 


V 

z 


v 

R 


28 3 ?| 1 __ A 
—— = 66 .7 A 
v'2 J 3 


The power dissipated at resonance is 

p = f xR = (66,7) 2 x3 = 13,35 kW 

You can see that in the present case, power dissipated 
at resonance is more than the power dissipated in Example 7.8. 










Example 7.10 At an airport, a person is made to walk through the 
doorway of a metal detector, for security reasons. If she/he is carrying 
anything made of metal, the metal detector emits a sound. On what 
principle does this detector work? 

Solution The metal detector works on the principle of resonance in 
ac circuits. When you walk through a metal detector, you are, 
in fact, walking through a coil of many turns. The coil is connected to 
a capacitor tuned so that the circuit is in resonance. When 
you walk through with metal in your pocket, the impedance of the 
circuit changes - resulting in significant change in current in the 
circuit. This change in current is detected and the electronic circuitry 
causes a sound to be emitted as an alarm. 


7.8 LC Oscillations 

We know that a capacitor and an inductor can store electrical and 
magnetic energy, respectively. When a capacitor (initially charged) is 
connected to an inductor, the charge on the capacitor and the current in 
the circuit exhibit the phenomenon of electrical oscillations similar to 
oscillations in mechanical systems (Chapter 14, Class XI). 

Let a capacitor be charged q m (at t = 0) and connected to an inductor as 
shown in Fig. 7.18. 



FIGURE 7 .18 At the instant shown, the current is increasing; so the polarity of induced emfin the 

inductor is as shown. 


The moment the circuit is completed, the charge on the capacitor starts 
decreasing, giving rise to current in the circuit. Let q and i be the charge 
and current in the circuit at time t. Since di/dt is positive, the induced emf 
in L will have polarity as shown, i.e., v b < v a . According to Kirchhoff’s loop 

rule, 


Q__ i ill 

0 rlf 


0 (7.39) 


i = - (dq/dt) in the present case (as q decreases, i increases). Therefore, 
Eq. (7.39) becomes: 



q - 0 (7.40) 

LC 


This equation has the form 


d*x 2 M 
+ oj lf x - 0 
dr 


for a simple harmonic 


oscillator. The charge on the capacitor, therefore, oscillates with a natural 
frequency 











", = ' (7.41) 

VLC 

and varies sinusoidally with time as 

q = q„, coa(a> ( _ t I 0 ) (7.42) 

where q m is the maximum value of q and cp is a phase constant. Since 
q = q m at t = 0, we have cos cp =1 or cp = 0. Therefore, in the present 
case, 

q q cos( 0 (7.43) 



dq j 
d t j 


is given by 


i i sin( 0 (7.44) 


where i,,, - a) 0 q m 

Let us now try to visualise how this oscillation takes place in the circuit. 
Figure 7.19(a) shows a capacitor with initial charge q m connected to an 
ideal inductor. The electrical energy stored in the charged capacitor is 

r r - < ^' ii . Since, there is no current in the circuit, energy in the 

2 C 

inductor is zero. Thus, the total energy of LC circuit is, 




u = ± kA 2 , K = 0 UtO.K^O U-O.K =1 mv^ V* O.K^O 17=1 kA 2 , K =0 


(a) (b) (c) (d) (e) 

Figure 7 .19 The oscillations in an LC circuit are analogous to the oscillation of a block at the end 

of a spring. The figure depicts one-half of a cycle. 


At t = 0, the switch is closed and the capacitor starts to discharge [Fig. 
7.19(b)]. As the current increases, it sets up a magnetic field in the 
inductor and thereby, some energy gets stored in the inductor in the form 

of magnetic energy: U B = (1/2) Li 2 . As the current reaches its maximum 

value i m , (at t = T/4) as in Fig. 7.19(c), all the energy is stored in the 

magnetic field: U B = (1/2) Li 2 m . You can easily check that the maximum 

electrical energy equals the maximum magnetic energy. The capacitor 
now has no charge and hence no energy. The current now starts 
charging the capacitor, as in Fig. 7.19(d). This process continues till the 
capacitor is fully charged (at t = T/2) [Fig. 7.19(e)]. But it is charged with a 
polarity opposite to its initial state in Fig. 7.19(a). The whole process just 
described will now repeat itself till the system reverts to its original state. 
Thus, the energy in the system oscillates between the capacitor and the 
inductor. 


The LC oscillation is similar to the mechanical oscillation of a block 
attached to a spring. The lower part of each figure in Fig. 7.19 depicts the 
corresponding stage of a mechanical system (a block attached to a 
spring). As noted earlier, for a block of a mass m oscillating with 
frequency u) 0 , the equation is 


















Here, & _ .Jfc / nr , and k is the spring constant. So, x corresponds to 

q. In case of a mechanical system F = ma = m (dv/dt) = m (d 2 x/dt 2 ). For 

an electrical system, z = -L (di/dt) = -L (d 2 q/dt 2 ). Comparing these two 
equations, we see that L is analogous to mass m: L is a measure of 
resistance to change in current. In case of LC circuit, _ 1 / VLC and 

for mass on a spring, ^ _ JkJm. ■ So, 1/C is analogous to k. The 

constant k (=F/x) tells us the (external) force required to produce a unit 
displacement whereas 1/C (=V/q) tells us the potential difference required 
to store a unit charge. Table 7.1 gives the analogy between mechanical 
and electrical quantities. 


Table 7.1 Analogies between mechanical and 

ELECTRICAL QUANTITIES 

Mechanical system 

Electrical system 

Mass m 

Inductance L 

Force constant k 

Reciprocal capacitance 
1/C 

Displacement x 

Charge q 

Velocity v - dx/dt 

Current i = dq/dt 

Mechanical energy 

Electromagnetic energy 

E = —kx 2 + —mu 2 

2 2 

L/ = -^- + -Li 2 

2 C 2 


Note that the above discussion of LC oscillations is not realistic for two 
reasons: 

(i) Every inductor has some resistance. The effect of this resistance is to 
introduce a damping effect on the charge and current in the circuit and 
the oscillations finally die away. 








(ii) Even if the resistance were zero, the total energy of the system would 
not remain constant. It is radiated away from the system in the form of 
electromagnetic waves (discussed in the next chapter). In fact, radio and 
TV transmitters depend on this radiation. 


Two different phenomena, same mathematical treatment 

You may like to compare the treatment of a forced damped oscillator 
discussed in Section 14.10 of Class XI physics textbook, with that of 
an LCR circuit when an ac voltage is applied in it. We have already 
remarked that Eq. [14.37(b)] of Class XI Textbook is exactly similar to 
Eq. (7.28) here, although they use different symbols and parameters. 
Let us therefore list the equivalence between different quantities in 
the two situations: 


Forced oscillations Driven LCR circuit 



/o: = F cos <s? d 



R ,lf f , 'I 
dt C 


'm 


Displacement, x Charge on capacitor, q 

Time, t Time, t 

Mass, m Self inductance, L 

Damping constant, b Resistance, R 

Spring constant, k Inverse capacitance, 1/C 

Driving frequency, w d Driving frequency, w 


Natural frequency of oscillations, w Natural frequency of LCR 
circuit, oj 0 



Amplitude of forced oscillations, A Maximum charge stored, q m 

Amplitude of driving force, F 0 Amplitude of applied voltage, v m 

You must note that since x corresponds to q, the 
amplitude A (maximum displacement) will correspond to the 
maximum charge stored, qm. Equation [14.39 (a)] of Class XI gives 
the amplitude of oscillations in terms of other parameters, which we 
reproduce here for convenience: 



Replace each parameter in the above equation by the corresponding 
electrical quantity, and see what happens. Eliminate L, C, u>, and wO, 

using X L = wL, X c = 1/wC, and w 0 2 = 1/LC. When you use Eqs. (7.33) 

and (7.34), you will see that there is a 
perfect match. 

You will come across numerous such situations in physics where 
diverse physical phenomena are represented by the same 
mathematical equation. If you have dealt with one of them, and you 
come across another situation, you may simply replace the 
corresponding quantities and interpret the result in the new context. 
We suggest that you may try to find more such parallel situations 
from different areas of physics. One must, of course, be aware of the 
differences too. 


Example 7.11 Show that in the free oscillations of an LC circuit, the 
sum of energies stored in the capacitor and the inductor is constant in 






time. 


Solution Let qO be the initial charge on a capacitor. Let the charged 
capacitor be connected to an inductor of inductance L. As you have 
studied in Section 7.8, this LC circuit will sustain an oscillation with 
frquency 


w 


2 pn 


I ' 

JLC , 


At an instant t, charge q on the capacitor and the current i are given 
by: 

q (t) = qO cos w t 
i (t) = — qO oj sin w t 


Energy stored in the capacitor at time t is 




cos S 

2 2 C 2C 


Energy stored in the inductor at time t is 

,, 1 . .2 
h »~2 Ll 

- * L q' of sin ( toi) 

2 

si n 2 ((Q (j) I / 4^X2 | 

Sum of energies 

[J |L= fjr IroHV;)/ sin" ail.') 

1 ■ '* J \ / 


<35 
2 C 










This sum is constant in time as qo and C, both are time-independent. 
Note that it is equal to the initial energy of the capacitor. Why it is so? 
Think! 


7.9 Transformers 

For many purposes, it is necessary to change (or transform) an 
alternating voltage from one to another of greater or smaller value. This is 
done with a device called transformer using the principle of mutual 
induction. 


A transformer consists of two sets of coils, insulated from each other. 
They are wound on a soft-iron core, either one on top of the other as in 
Fig. 7.20(a) or on separate limbs of the core as in Fig. 7.20(b). One of the 
coils called the primary coil has N p turns. The other coil is called the 

secondary coil; it has N s turns. Often the primary coil is the input coil and 


the secondary coil is the output coil of the transformer. 


Soft iron-core 


T3 


4 



(a) 


(b) 


Figure 7.20 Two arrangements for winding of primary and secondary coil in a transformer: 
(a) two coils on top of each other, (b) two coils on separate limbs of the core. 


When an alternating voltage is applied to the primary, the resulting 
current produces an alternating magnetic flux which links the secondary 
























and induces an emf in it. The value of this emf depends on the number of 
turns in the secondary. We consider an ideal transformer in which the 
primary has negligible resistance and all the flux in the core links both 
primary and secondary windings. Let (p be the flux in each turn in the 
core at time t due to current in the primary when a voltage v p is applied to 

it. 

Then the induced emf or voltage £ s , in the secondary with N s turns is 





(7.45) 


The alternating flux cp also induces an emf, called back emf in the 
primary. This is 


-A f 


d 0 

Tit 


(7.46) 


But £ p = Vp. If this were not so, the primary current would be infinite since 

the primary has zero resistance (as assumed). If the secondary is an 
open circuit or the current taken from it is small, then to a good 
approximation 


£ s = v s 


where v s is the voltage across the secondary. Therefore, Eqs. (7.45) and 
(7.46) can be written as 


n 





[7.45(a)] 


r 


, hJ 



[7.46(a)] 


From Eqs. [7.45 (a)] and [7.46 (a)], we have 



Note that the above relation has been obtained using three assumptions: 
(i) the primary resistance and current are small; (ii) the same flux links 
both the primary and the secondary as very little flux escapes from the 
core, and (iii) the secondary current is small. 

If the transformer is assumed to be 100% efficient (no energy losses), the 
power input is equal to the power output, and since p = i v, 

ipVp = i s v s (7.48) 


Although some energy is always lost, this is a good approximation, since 
a well designed transformer may have an efficiency of more than 95%. 
Combining Eqs. (7.47) and (7.48), we have 


L? 


v, 




(7.49) 


Since i and v both oscillate with the same frequency as the ac source, 
Eq. (7.49) also gives the ratio of the amplitudes or rms values of 
corresponding quantities. 

Now, we can see how a transformer affects the voltage and current. We 
have: 



N ) 


(N \ 

= 


V p and I s = 

r 


\ ^ P J 


K N s ) 


p 


(7.50) 


That is, if the secondary coil has a greater number of turns than the 
primary (N s > N p ), the voltage is stepped up (V s > V p ). This type of 

arrangement is called a step-up transformer. However, in this 
arrangement, there is less current in the secondary than in the primary 



(Np/N s < 1 and l s < l p ). For example, if the primary coil of a transformer 
has 100 turns and the secondary has 200 turns, N s /N p = 2 and 
N p /N s =1/2. Thus, a 220V input at 10A will step-up to 440 V output at 5.0 
A. 

If the secondary coil has less turns than the primary (N s < N p ), we have a 

step-down transformer. In this case, V s < V p and l s > l p . That is, the 

voltage is stepped down, or reduced, and the current is increased. 

The equations obtained above apply to ideal transformers (without any 
energy losses). But in actual transformers, small energy losses do occur 
due to the following reasons: 

(i) Flux Leakage: There is always some flux leakage; that is, not all of the 
flux due to primary passes through the secondary due to poor design of 
the core or the air gaps in the core. It can be reduced by winding the 
primary and secondary coils one over the other. 

(ii) Resistance of the windings: The wire used for the windings has some 
resistance and so, energy is lost due to heat produced in the wire 

(I 2 R). In high current, low voltage windings, these are minimised by 
using thick wire. 

(iii) Eddy currents: The alternating magnetic flux induces eddy currents in 
the iron core and causes heating. The effect is reduced by using a 
laminated core. 

(iv) Hysteresis: The magnetisation of the core is repeatedly reversed by 
the alternating magnetic field. The resulting expenditure of energy in the 
core appears as heat and is kept to a minimum by using a magnetic 
material which has a low hysteresis loss. 

The large scale transmission and distribution of electrical energy over 



long distances is done with the use of transformers. The voltage output of 
the generator is stepped-up (so that current is reduced and consequently, 

the l 2 R loss is cut down). It is then transmitted over long distances to an 
area sub-station near the consumers. There the voltage is stepped down. 
It is further stepped down at distributing sub-stations and utility poles 
before a power supply of 240 V reaches our homes. 


Summary 

1. An alternating voltage v = v sin oH applied to a resistor R 

i! ■ 

drives a current i = i m sinwt in the resistor, / - —. The current is in 

m r,. R 

phase with the applied voltage. 

2. For an alternating current i = i m sin wt passing through a resistor R, 

the average power loss P (averaged over a cycle) due to joule 
heating is 

(1/2)i2 m R. To express it in the same form as the dc power (P = I2R), a 

special value of current is used. It is called root mean square (rms) 
current and is donoted by I: 

l -k= 0.7071,, 

Similarly, the rms voltage is defined by 

V 0.7071’ 

V2 

We have P = IV = I2R 

3. An ac voltage v = v m sin wt applied to a pure inductor L, drives a 






current in the inductor i = i m sin (cot - tt/2), where i m = v m /X L . X L = coL 

is called inductive reactance. The current in the inductor lags the 
voltage by 

tt/2. The average power supplied to an inductor over one complete 
cycle is zero. 

4. An ac voltage v = v m sincot applied to a capacitor drives a current 
in the capacitor: i = i m sin (cot + tt/2). Here, 

i„ - i ' ;i: . X - 1 is called capacitive reactance. 

X . oXl 

The current through the capacitor is tt/2 ahead of the applied voltage. 
As in the case of inductor, the average power supplied to a capacitor 
over one complete cycle is zero. 

5. For a series RLC circuit driven by voltage v = v m sin cot, the current 
is given by i = i m sin (cot + (p) 

t 

r ,, 

where = j n 

+ [ X ■ ■ — X i ) 

. . X r X < 

and o - ta r i 

u 

Z = tJr- i (x,. X,f is called the impedance of the circuit. 

The average power loss over a complete cycle is given by 
P = V I coscp 

The term coscp is called the power factor. 

6. In a purely inductive or capacitive circuit, coscp = 0 and no power is 
dissipated even though a current is flowing in the circuit. In such 






cases, current is referred to as a wattless current. 

7. The phase relationship between current and voltage in an ac circuit 
can be shown conveniently by representing voltage and current by 
rotating vectors called phasors. A phasor is a vector which rotates 
about the origin with angular speed w. The magnitude of a phasor 
represents the amplitude or peak value of the quantity (voltage or 
current) represented by the phasor. 

The analysis of an ac circuit is facilitated by the use of a phasor 
diagram. 

8. An interesting characteristic of a series RLC circuit is the 
phenomenon of resonance. The circuit exhibits resonance, i.e., the 
amplitude of the current is maximum at the resonant frequency, 

(0 - 1 . . The quality factor Q defined by 

V L/C' 

Q - ' " 1 is an indicator of the sharpness of the resonance, 

R oj u CR 

the higher value of Q indicating sharper peak in the current. 

9. A circuit containing an inductor L and a capacitor C (initially 
charged) with no ac source and no resistors exhibits free oscillations. 
The charge q of the capacitor satisfies the equation of simple 
harmonic motion: 


and therefore, the frequency w of free oscillation is (o - . . The 

v 

energy in the system oscillates between the capacitor and the 
inductor but their sum or the total energy is constant in time. 




10. A transformer consists of an iron core on which are bound a 
primary coil of N p turns and a secondary coil of N s turns. If the 

primary coil is connected to an ac source, the primary and secondary 
voltages are related by 


and the currents are related by 


4 = 


/V "'i 

p 

N , 


If the secondary coil has a greater number of turns than the primary, 
the voltage is stepped-up (V s > V p ). This type of arrangement is 

called a step-up transformer. If the secondary coil has turns less than 
the primary, we have a step-down transformer. 









Physical quantity Symbol Dimensions Unit 


Remarks 


rms voltage 

V 

[M L 3 ! 1 " 3 A’ 1 ] 

V 

V = - 7 = , v is the 

V2 





amplitude of the ac voltage. 

rms current 

I 

LA) 

A 

I - -^j= , ^ is the amplitude of 





the ac current. 

Reactance; 

Inductive 

Capacitive 

Impedance 

z 

iML^T^A -2 ] 
|ML 2 T' a A" 2 ] 

[ML 2 T' a A" 2 ] 


= l/fu C 

Depends on elements 

present in the circuit. 

Resonant 

frequency 

(D r or u) 4 

IT -1 ] 

Hz 

q. = J_ for a 

Vlc 

series RLC circuit 

Quality factor 

9 

Dimensionless 


„ ffck L 1 

Q “ = for a series 

R 

Power factor 


Dimensionless 


RLC circuit. 

= cos^, 0 is the phase 
difference between voltage 
applied and current in 
the circuit. 


Points to Ponder 

1. When a value is given for ac voltage or current, it is ordinarily the 
rms value. The voltage across the terminals of an outlet in your room 
is normally 240 V. This refers to the rms value of the voltage. The 
amplitude of this voltage is 

- *J2V - ^2 (240) - 840 V 

2. The power rating of an element used in ac circuits refers to its 
average power rating. 








3. The power consumed in an ac circuit is never negative. 

4. Both alternating current and direct current are measured in 
amperes. But how is the ampere defined for an alternating current? It 
cannot be derived from the mutual attraction of two parallel wires 
carrying ac currents, as the dc ampere is derived. An ac current 
changes direction with the source frequency and the attractive force 
would average to zero. Thus, the ac ampere must be defined in terms 
of some property that is independent of the direction of the current. 
Joule heating is such a property, and there is one ampere of rms 
value of alternating current in a circuit if the current produces the 
same average heating effect as one ampere of dc current would 
produce under the same conditions. 

5. In an ac circuit, while adding voltages across different elements, 
one should take care of their phases properly. For example, if V R and 

V c are voltages across R and C, respectively in an RC circuit, then 
the total voltage across RC combination is - Vy and not 

V R + V c since V c is tt/2 out of phase of V R . 

6. Though in a phasor diagram, voltage and current are represented 
by vectors, these quantities are not really vectors themselves. They 
are scalar quantities. It so happens that the amplitudes and phases of 
harmonically varying scalars combine mathematically in the same 
way as do the projections of rotating vectors of corresponding 
magnitudes and directions. The ‘rotating vectors’ that represent 
harmonically varying scalar quantities are introduced only to provide 
us with a simple way of adding these quantities using a rule that we 
already know as the law of vector addition. 

7. There are no power losses associated with pure capacitances and 





pure inductances in an ac circuit. The only element that dissipates 
energy in an ac circuit is the resistive element. 

8. In a RLC circuit, resonance phenomenon occur when X L = X c or 

co - , 1 . For resonance to occur, the presence of both L and C 

V'LC 

elements in the circuit is a must. With only one of these (L or C) 
elements, there is no possibility of voltage cancellation and hence, no 
resonance is possible. 

9. The power factor in a RLC circuit is a measure of how close the 
circuit is to expending the maximum power. 

10. In generators and motors, the roles of input and output are 
reversed. In a motor, electric energy is the input and mechanical 
energy is the output. In a generator, mechanical energy is the input 
and electric energy is the output. Both devices simply transform 
energy from one form to another. 

11. A transformer (step-up) changes a low-voltage into a high-voltage. 
This does not violate the law of conservation of energy. The current is 
reduced by the same proportion. 

12. The choice of whether the description of an oscillatory motion is 
by means of sines or cosines or by their linear combinations is 
unimportant, since changing the zero-time position transforms the 
one to the other. 


Exercises 

7.1 A 100 Q resistor is connected to a 220 V, 50 Hz ac supply. 








(a) What is the rms value of current in the circuit? 

(b) What is the net power consumed over a full cycle? 

7.2 (a) The peak voltage of an ac supply is 300 V. What is the rms 
voltage? 

(b) The rms value of current in an ac circuit is 10 A. What is the peak 
current? 

7.3 A 44 mH inductor is connected to 220 V, 50 Hz ac supply. 
Determine the rms value of the current in the circuit. 

7.4 A 60 pF capacitor is connected to a 110 V, 60 Hz ac supply. 
Determine the rms value of the current in the circuit. 

7.5 In Exercises 7.3 and 7.4, what is the net power absorbed by each 
circuit over a complete cycle. Explain your answer. 

7.6 Obtain the resonant frequency cor of a series LCR circuit with 

L = 2.OH, C = 32 pF and R = 10 Q. What is the Q-value of this circuit? 

7.7 A charged 30 pF capacitor is connected to a 27 mH inductor. 
What is the angular frequency of free oscillations of the circuit? 

7.8 Suppose the initial charge on the capacitor in Exercise 7.7 is 6 
mC. What is the total energy stored in the circuit initially? What is the 
total energy at later time? 

7.9 A series LCR circuit with R = 20 Q, L = 1.5 H and C = 35 pF is 
connected to a variable-frequency 200 V ac supply. When the 
frequency of the supply equals the natural frequency of the circuit, 
what is the average power transferred to the circuit in one complete 
cycle? 

7.10 A radio can tune over the frequency range of a portion of MW 
broadcast band: (800 kHz to 1200 kHz). If its LC circuit has an 




effective inductance of 200 |jH, what must be the range of its variable 
capacitor? 

[Hint: For tuning, the natural frequency i.e., the frequency of free 
oscillations of the LC circuit should be equal to the frequency of the 
radiowave.] 

7.11 Figure 7.21 shows a series LCR circuit connected to a variable 

frequency 230 V source. L = 5.0 H, C = 80pF, R = 40 Q. 

R 



i- -ornT 1 ' -1 

L 

Figure 7.21 

(a) Determine the source frequency which drives the circuit in 
resonance. 

(b) Obtain the impedance of the circuit and the amplitude of current at 
the resonating frequency. 

(c) Determine the rms potential drops across the three elements of 
the circuit. Show that the potential drop across the LC combination is 
zero at the resonating frequency. 


Additional Exercises 

7.12 An LC circuit contains a 20 mH inductor and a 50 pF capacitor 
with an initial charge of 10 mC. The resistance of the circuit is 
negligible. Let the instant the circuit is closed be t = 0. 















(a) What is the total energy stored initially? Is it conserved during LC 
oscillations? 

(b) What is the natural frequency of the circuit? 

(c) At what time is the energy stored 

(i) completely electrical (i.e., stored in the capacitor)? (ii) completely 
magnetic (i.e., stored in the inductor)? 

(d) At what times is the total energy shared equally between the 
inductor and the capacitor? 

(e) If a resistor is inserted in the circuit, how much energy is 
eventually dissipated as heat? 

7.13 A coil of inductance 0.50 H and resistance 100 Q is connected 
to a 240 V, 50 Hz ac supply. 

(a) What is the maximum current in the coil? 

(b) What is the time lag between the voltage maximum and the 
current maximum? 

7.14 Obtain the answers (a) to (b) in Exercise 7.13 if the circuit is 
connected to a high frequency supply (240 V, 10 kHz). Hence, 
explain the statement that at very high frequency, an inductor in a 
circuit nearly amounts to an open circuit. How does an inductor 
behave in a dc circuit after the steady state? 

7.15 A 100 pF capacitor in series with a 40 Q resistance is connected 
to a 110 V, 60 Hz supply. 

(a) What is the maximum current in the circuit? 

(b) What is the time lag between the current maximum and the 
voltage maximum? 




7.16 Obtain the answers to (a) and (b) in Exercise 7.15 if the circuit is 
connected to a 110 V, 12 kHz supply? Hence, explain the statement 
that a capacitor is a conductor at very high frequencies. Compare this 
behaviour with that of a capacitor in a dc circuit after the steady state. 

7.17 Keeping the source frequency equal to the resonating frequency 
of the series LCR circuit, if the three elements, L, C and R are 
arranged in parallel, show that the total current in the parallel LCR 
circuit is minimum at this frequency. Obtain the current rms value in 
each branch of the circuit for the elements and source specified in 
Exercise 7.11 for this frequency. 

7.18 A circuit containing a 80 mH inductor and a 60 pF capacitor in 
series is connected to a 230 V, 50 Hz supply. The resistance of the 
circuit is negligible. 

(a) Obtain the current amplitude and rms values. 

(b) Obtain the rms values of potential drops across each element. 

(c) What is the average power transferred to the inductor? 

(d) What is the average power transferred to the capacitor? 

(e) What is the total average power absorbed by the circuit? 
[‘Average’ implies ‘averaged over one cycle’.] 

7.19 Suppose the circuit in Exercise 7.18 has a resistance of 15 Q. 
Obtain the average power transferred to each element of the circuit, 
and the total power absorbed. 

7.20 A series LCR circuit with L = 0.12 H, C = 480 nF, R = 23 Q is 
connected to a 230 V variable frequency supply. 

(a) What is the source frequency for which current amplitude is 
maximum. Obtain this maximum value. 




(b) What is the source frequency for which average power absorbed 
by the circuit is maximum. Obtain the value of this maximum power. 

(c) For which frequencies of the source is the power transferred to 
the circuit half the power at resonant frequency? What is the current 
amplitude at these frequencies? 

(d) What is the Q-factor of the given circuit? 

7.21 Obtain the resonant frequency and Q-factor of a series LCR 
circuit with L = 3.0 H, C = 27 pF, and R = 7.4 Q. It is desired to 
improve the sharpness of the resonance of the circuit by reducing its 
‘full width at half maximum’ by a factor of 2. Suggest a suitable way. 

7.22 Answer the following questions: 

(a) In any ac circuit, is the applied instantaneous voltage equal to the 
algebraic sum of the instantaneous voltages across the series 
elements of the circuit? Is the same true for rms voltage? 

(b) A capacitor is used in the primary circuit of an induction coil. 

(c) An applied voltage signal consists of a superposition of a dc 
voltage and an ac voltage of high frequency. The circuit consists of an 
inductor and a capacitor in series. Show that the dc signal will appear 
across C and the ac signal across L. 

(d) A choke coil in series with a lamp is connected to a dc line. The 
lamp is seen to shine brightly. Insertion of an iron core in the choke 
causes no change in the lamp’s brightness. Predict the 
corresponding observations if the connection is to an ac line. 

(e) Why is choke coil needed in the use of fluorescent tubes with ac 
mains? Why can we not use an ordinary resistor instead of the choke 
coil? 




7.23 A power transmission line feeds input power at 2300 V to a step- 
down transformer with its primary windings having 4000 turns. What 
should be the number of turns in the secondary in order to get output 
power at 230 V? 

7.24 At a hydroelectric power plant, the water pressure head is at a 

height of 300 m and the water flow available is 100 m 3 s -1 . If the 
turbine generator efficiency is 60%, estimate the electric power 

available from the plant (g = 9.8 ms -2 ). 

7.25 A small town with a demand of 800 kW of electric power at 220 
V is situated 15 km away from an electric plant generating power at 
440 V. The resistance of the two wire line carrying power is 0.5 Q per 
km. The town gets power from the line through a 4000-220 V step- 
down transformer at a sub-station in the town. 

(a) Estimate the line power loss in the form of heat. 

(b) How much power must the plant supply, assuming there is 
negligible power loss due to leakage? 

(c) Characterise the step up transformer at the plant. 

7.26 Do the same exercise as above with the replacement of the 
earlier transformer by a 40,000-220 V step-down transformer 
(Neglect, as before, leakage losses though this may not be a good 
assumption any longer because of the very high voltage transmission 
involved). Hence, explain why high voltage transmission is preferred? 
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Chapter Eight 


Electromagnetic Waves 


8.1 Introduction 

In Chapter 4, we learnt that an electric current produces magnetic field 
and that two current-carrying wires exert a magnetic force on each other. 
Further, in Chapter 6, we have seen that a magnetic field changing with 
time gives rise to an electric field. Is the converse also true? Does an 
electric field changing with time give rise to a magnetic field? James 
Clerk Maxwell (1831-1879), argued that this was indeed the case - not 
only an electric current but also a time-varying electric field generates 
magnetic field. While applying the Ampere’s circuital law to find magnetic 
field at a point outside a capacitor connected to a time-varying current, 
Maxwell noticed an inconsistency in the Ampere’s circuital law. He 
suggested the existence of an additional current, called by him, the 
displacement current to remove this inconsistency. 

Maxwell formulated a set of equations involving electric and magnetic 
fields, and their sources, the charge and current densities. These 
equations are known as Maxwell’s equations. Together with the Lorentz 
force formula (Chapter 4), they mathematically express all the basic laws 
of electromagnetism. 

The most important prediction to emerge from Maxwell’s equations is the 
existence of electromagnetic waves, which are (coupled) time-varying 
electric and magnetic fields that propagate in space. The speed of the 
waves, according to these equations, turned out to be very close to the 

speed of light( 3 *10 8 m/s), obtained from optical measurements. This led 




to the remarkable conclusion that light is an electromagnetic wave. 
Maxwell’s work thus unified the domain of electricity, magnetism and 
light. Hertz, in 1885, experimentally demonstrated the existence of 
electromagnetic waves. Its technological use by Marconi and others led 
in due course to the revolution in communication that we are witnessing 
today. 



James Clerk Maxwell (1831 - 1879) Born in Edinburgh, Scotland, 
was among the greatest physicists of the nineteenth century. He 
derived the thermal velocity distribution of molecules in a gas and 
was among the first to obtain reliable estimates of molecular 
parameters from measurable quantities like viscosity, etc. Maxwell’s 
greatest acheivement was the unification of the laws of electricity and 
magnetism (discovered by Coulomb, Oersted, Ampere and Faraday) 
into a consistent set of equations now called Maxwell’s equations. 

From these he arrived at the most important conclusion that light is 
an electromagnetic wave. Interestingly, Maxwell did not agree with 
the idea (strongly suggested by the Faraday’s laws of electrolysis) 
that electricity was particulate in nature. 

In this chapter, we first discuss the need for displacement current and its 
consequences. Then we present a descriptive account of 
electromagnetic waves. The broad spectrum of electromagnetic waves, 

stretching from y rays (wavelength ~10 -12 m) to long radio waves 



(wavelength ~10 6 m) is described. How the electromagnetic waves are 
sent and received for communication is discussed in Chapter 15. 

8.2 Displacement Current 

We have seen in Chapter 4 that an electrical current produces a 
magnetic field around it. Maxwell showed that for logical consistency, a 
changing electric field must also produce a magnetic field. This effect is 
of great importance because it explains the existence of radio waves, 
gamma rays and visible light, as well as all other forms of 
electromagnetic waves. 

To see how a changing electric field gives rise to a magnetic field, let us 
consider the process of charging of a capacitor and apply Ampere’s 
circuital law given by (Chapter 4) 

fB-dl = ^i[f) (8.1) 

to find magnetic field at a point outside the capacitor. 

Figure 8.1(a) shows a parallel plate capacitor C which is a part of circuit 
through which a time-dependent current i (t) flows . Let us find the 
magnetic field at a point such as P, in a region outside the parallel plate 
capacitor. For this, we consider a plane circular loop of radius r whose 
plane is perpendicular to the direction of the current-carrying wire, and 
which is centred symmetrically with respect to the wire [Fig. 8.1(a)]. From 
symmetry, the magnetic field is directed along the circumference of the 
circular loop and is the same in magnitude at all points on the loop so 
that if B is the magnitude of the field, the left side of Eq. (8.1) is B (2tt r). 
So we have 

B (2-nr) = p 0 i (t) (8 .2) 

Now, consider a different surface, which has the same boundary. This is 


a pot like surface [Fig. 8.1(b)] which nowhere touches the current, but 
has its bottom between the capacitor plates; its mouth is the circular loop 
mentioned above. Another such surface is shaped like a tiffin box 
(without the lid) [Fig. 8.1(c)]. On applying Ampere’s circuital law to such 
surfaces with the same perimeter, we find that the left hand side of Eq. 
(8.1) has not changed but the right hand side is zero and not pOi, since 
no current passes through the surface of Fig. 8.1(b) and (c). So we have 
a contradiction; calculated one way, there is a magnetic field at a point P; 
calculated another way, the magnetic field at P is zero. 

Since the contradiction arises from our use of Ampere’s circuital law, this 
law must be missing something. The missing term must be such that one 
gets the same magnetic field at point P, no matter what surface is used. 
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Figure 8.1 A parallel plate capacitor C, as part of a circuit through which a time dependent 
current i (t) flows, (a) a loop of radius r, to determine magnetic field at a point P on the loop; (b) a 
pot-shaped surface passing through the interior between the capacitor plates with the loop shown 
in (a) as its rim; (c) a tiffin-shaped surface with the circular loop as its rim and a flat circular 
bottom S between the capacitor plates. The arrows show uniform electric field between the 

capacitor plates. 


We can actually guess the missing term by looking carefully at Fig. 
8.1(c). Is there anything passing through the surface S between the 
plates of the capacitor? Yes, of course, the electric field! If the plates of 
the capacitor have an area A, and a total charge Q, the magnitude of the 
electric field E between the plates is (Q/A)/£ 0 (see Eq. 2.41). The field is 

perpendicular to the surface S of Fig. 8.1(c). It has the same magnitude 
over the area A of the capacitor plates, and vanishes outside it. So what 
is the electric flux (PE through the surface S ? Using Gauss’s law, it is 


= E .4 


i_£^ = S 


(8.3) 


Now if the charge Q on the capacitor plates changes with time, there is a 
current i = (dQ/dt), so that using Eq. (8.3), we have 
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This implies that for consistency, 




dF 


E - 


dr 


= i 


(8.4) 


This is the missing term in Ampere’s circuital law. If we generalise this 
law by adding to the total current carried by conductors through the 
surface, another term which is £ 0 times the rate of change of electric flux 

through the same surface, the total has the same value of current i for all 
surfaces. If this is done, there is no contradiction in the value of B 
obtained anywhere using the generalised Ampere’s law. B at the point P 
is non-zero no matter which surface is used for calculating it. B at a point 
P outside the plates [Fig. 8.1(a)] is the same as at a point M just inside, 
as it should be. The current carried by conductors due to flow of charges 
is called conduction current. The current, given by Eq. (8.4), is a new 
term, and is due to changing electric field (or electric displacement, an 
old term still used sometimes). It is, therefore, called displacement 
current or Maxwell’s displacement current. Figure 8.2 shows the electric 
and magnetic fields inside the parallel plate capacitor discussed above. 

The generalisation made by Maxwell then is the following. The source of 
a magnetic field is not just the conduction electric current due to flowing 
charges, but also the time rate of change of electric field. More precisely, 
the total current i is the sum of the conduction current denoted by ic, and 
the displacement current denoted by i d (= £ 0 (dOE/dt)). So we have 


i=' ; , ' =' ' ^ (8-5) 

( 1 / 




Figure 8.2 (a) The electric and magnetic fields E and B between the capacitor plates, at the point 

M. (b) A cross sectional view of Fig. (a). 

In explicit terms, this means that outside the capacitor plates, we have 
only conduction current i c = i, and no displacement current, i.e., id = 0. 

On the other hand, inside the capacitor, there is no conduction current, 
i.e., i c = 0, and there is only displacement current, so that id = i. 

The generalised (and correct) Ampere’s circuital law has the same form 
as Eq. (8.1), with one difference: “the total current passing through any 
surface of which the closed loop is the perimeter” is the sum of the 









conduction current and the displacement current. The generalised law is 

( 8 . 6 ) 

and is known as Ampere-Maxwell law. 

In all respects, the displacement current has the same physical effects as 
the conduction current. In some cases, for example, steady electric fields 
in a conducting wire, the displacement current may be zero since the 
electric field E does not change with time. In other cases, for example, 
the charging capacitor above, both conduction and displacement currents 
may be present in different regions of space. In most of the cases, they 
both may be present in the same region of space, as there exist no 
perfectly conducting or perfectly insulating medium. Most interestingly, 
there may be large regions of space where there is no conduction 
current, but there is only a displacement current due to time-varying 
electric fields. In such a region, we expect a magnetic field, though there 
is no (conduction) current source nearby! The prediction of such a 
displacement current can be verified experimentally. For example, a 
magnetic field (say at point M) between the plates of the capacitor in Fig. 
8.2(a) can be measured and is seen to be the same as that just outside 
(at P). 

The displacement current has (literally) far reaching consequences. One 
thing we immediately notice is that the laws of electricity and magnetism 
are now more symmetrical*. Faraday’s law of induction states that there 
is an induced emf equal to the rate of change of magnetic flux. Now, 
since the emf between two points 1 and 2 is the work done per unit 
charge in taking it from 1 to 2, the existence of an emf implies the 
existence of an electric field. So, we can rephrase Faraday’s law of 
electromagnetic induction by saying that a magnetic field, changing with 
time, gives rise to an electric field. Then, the fact that an electric field 



changing with time gives rise to a magnetic field, is the symmetrical 
counterpart, and is a consequence of the displacement current being a 
source of a magnetic field. Thus, time- dependent electric and magnetic 
fields give rise to each other! Faraday’s law of electromagnetic induction 
and Ampere-Maxwell law give a quantitative expression of this statement, 
with the current being the total current, as in Eq. (8.5). One very 
important consequence of this symmetry is the existence of 
electromagnetic waves, which we discuss qualitatively in the next 
section. 

* They are still not perfectly symmetrical; there are no known sources of 
magnetic field (magnetic monopoles) analogous to electric charges which 
are sources of electric field. 


1 . 

2 . 

3. 

4. 


Maxwell’s equations 

(}) E*dA = Q fe 0 (Gauss’s Law for electricity) 
cji B*dA -0 (Gauss’s Law for magnetism) 

(b E-dl- (Faraday’s Law) 

J dr 


cj)B*dI 



(Ampere - Maxwell Law) 


Example 8.1 A parallel plate capacitor with circular plates of radius 1 
m has a capacitance of 1 nF. At t = 0, it is connected for charging in 
series with a resistor R = 1 MQ across a 2V battery (Fig. 8.3). 
Calculate the magnetic field at a point P, halfway between the centre 






and the periphery of the plates, after t = 10 -3 s. (The charge on the 
capacitor at time t is q (t) = CV [1 - exp where the time 

constant t is equal to CR.) 
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Figure 8.3 


2V 

Solution The time constant of the CR circuit is t = CR = 10 -3 s. 
Then, we have 

q(t) = CV [1 - exp (— t/T)] 

= 2 x 10 -9 [1- exp (-t/10- 3 )] 

The electric field in between the plates at time t is 

f /(0 _ <1 . a -„ My 2 m 2 - 


E- 


; A = tt (1 Y rrr = area of the plates. 


A, A 71 

Consider now a circular loop of radius (1/2) m parallel to the plates 
passing through P. The magnetic field B at all points on the loop is 

along the loop and of the same value. 

The flux <t>E through this loop is 

cpE = E x area of the loop 


= E X 7T X f - 

u 


tzE _ q 
















The displacement current 


I,. - £'n 


d 0 r 
ci r 


i dq 
1 d t 


= 0.5 x 10 6 exp ( I) 


at t = 10 _3 s. Now, applying Ampere-Maxwell law to the loop, we get 


Bx2jt x 


r 

2, 


-o ( i c + i d ) = -o (0 + i d ) = 0.5x 10- 6 pOexp(-1) 


or, B = 0.74 x 10- 13 T 


8.3 Electromagnetic Waves 

8.3.1 Sources of electromagnetic waves 

How are electromagnetic waves produced? Neither stationary charges 
nor charges in uniform motion (steady currents) can be sources of 
electromagnetic waves. The former produces only electrostatic fields, 
while the latter produces magnetic fields that, however, do not vary with 
time. It is an important result of Maxwell’s theory that accelerated 
charges radiate electromagnetic waves. The proof of this basic result is 
beyond the scope of this book, but we can accept it on the basis of 
rough, qualitative reasoning. Consider a charge oscillating with some 
frequency. (An oscillating charge is an example of accelerating charge.) 
This produces an oscillating electric field in space, which produces an 
oscillating magnetic field, which in turn, is a source of oscillating electric 
field, and so on. The oscillating electric and magnetic fields thus 
regenerate each other, so to speak, as the wave propagates through the 
space. 

The frequency of the electromagnetic wave naturally equals the 





frequency of oscillation of the charge. The energy associated with the 
propagating wave comes at the expense of the energy of the source - 
the accelerated charge. 

From the preceding discussion, it might appear easy to test the prediction 
that light is an electromagnetic wave. We might think that all we needed 
to do was to set up an ac circuit in which the current oscillate at the 
frequency of visible light, say, yellow light. But, alas, that is not possible. 

The frequency of yellow light is about 6 * 10 14 Hz, while the frequency 

that we get even with modern electronic circuits is hardly about 10 11 Hz. 
This is why the experimental demonstration of electromagnetic wave had 
to come in the low frequency region (the radio wave region), as in the 
Hertz’s experiment (1887). 

Hertz’s successful experimental test of Maxwell’s theory created a 
sensation and sparked off other important works in this field. Two 
important achievements in this connection deserve mention. Seven years 
after Hertz, Jagdish Chandra Bose, working at Calcutta (now Kolkata), 
succeeded in producing and observing electromagnetic waves of much 
shorter wavelength (25 mm to 5 mm). His experiment, like that of Hertz’s, 
was confined to the laboratory. 

At around the same time, Guglielmo Marconi in Italy followed Hertz’s 
work and succeeded in transmitting electromagnetic waves over 
distances of many kilometres. Marconi’s experiment marks the beginning 
of the field of communication using electromagnetic waves. 

8.3.2 Nature of electromagnetic waves 

It can be shown from Maxwell’s equations that electric and magnetic 
fields in an electromagnetic wave are perpendicular to each other, and to 
the direction of propagation. It appears reasonable, say from our 



discussion of the displacement current. Consider Fig. 8.2. The electric 
field inside the plates of the capacitor is directed perpendicular to the 
plates. The magnetic field this gives rise to via the displacement current 
is along the perimeter of a circle parallel to the capacitor plates. So B and 
E are perpendicular in this case. This is a general feature. 



Heinrich Rudolf Hertz (1857 - 1894) German physicist who was the 
first to broadcast and receive radio waves. He produced electro¬ 
magnetic waves, sent them through space, and measured their 
wave-length and speed. He showed that the nature of their vibration, 
reflection and refraction was the same as that of light and heat 
waves, establishing their identity for the first time. He also pioneered 
research on discharge of electricity through gases, and discovered 
the photoelectric effect. 

In Fig. 8.4, we show a typical example of a plane electromagnetic wave 
propagating along the z direction (the fields are shown as a function of 
the z coordinate, at a given time t). The electric field Ex is along the x- 
axis, and varies sinusoidally with z, at a given time. The magnetic field By 
is along the y-axis, and again varies sinusoidally with z. The electric and 
magnetic fields Ex and By are perpendicular to each other, and to the 
direction z of propagation. We can write Ex and By as follows: 

Ex= E 0 sin (kz-cot) [8.7(a)] 



By= B 0 sin (kz-cot) [8.7(b)] 

Here k is related to the wave length A of the wave by the usual equation 
k = — (8.8) 

A 


x 



FIGURE 8.4 A linearly polarised electromagnetic wave,propagating in the z-direction with the 
oscillating electric field E along the x-direction and the oscillating magnetic field B along the y- 

direction. 
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Simulate propagation of electromagnetic waves 

(i) http://www.amanogawa.com/waves.html 

(ii) 

http://www.phys.hawaii.edu/~teb/java/ntnujava/emWave/emWave.html 

and co is the angular frequency, k is the magnitude of the wave vector (or 
propagation vector) k and its direction describes the direction of 
propagation of the wave. The speed of propagation of the wave is (w/k). 
Using Eqs. [8.7(a) and (b)] for Ex and By and Maxwell’s equations, one 

finds that 








































w = ck, where, c = [8.9(a)] 

The relation w = ck is the standard one for waves (see for example, 
Section 15.4 of class XI Physics textbook). This relation is often written in 
terms of frequency, v (=u)/2tt) and wavelength, A (=2n/k) as 

= j or vA = c [8.9(b)] 

It is also seen from Maxwell’s equations that the magnitude of the electric 
and the magnetic fields in an electromagnetic wave are related as 

B 0 = (E 0 /c) (8.10) 

We here make remarks on some features of electromagnetic waves. 
They are self-sustaining oscillations of electric and magnetic fields in free 
space, or vacuum. They differ from all the other waves we have studied 
so far, in respect that no material medium is involved in the vibrations of 
the electric and magnetic fields. Sound waves in air are longitudinal 
waves of compression and rarefaction. Transverse elastic (sound) waves 
can also propagate in a solid, which is rigid and that resists shear. 
Scientists in the nineteenth century were so much used to this 
mechanical picture that they thought that there must be some medium 
pervading all space and all matter, which responds to electric and 
magnetic fields just as any elastic medium does. They called this medium 
ether. They were so convinced of the reality of this medium, that there is 
even a novel called The Poison Belt by Sir Arthur Conan Doyle (the 
creator of the famous detective Sherlock Holmes) where the solar system 
is supposed to pass through a poisonous region of ether! We now accept 
that no such physical medium is needed. The famous experiment of 
Michelson and Morley in 1887 demolished conclusively the hypothesis of 
ether. Electric and magnetic fields, oscillating in space and time, can 
sustain each other in vacuum. 



But what if a material medium is actually there? We know that light, an 
electromagnetic wave, does propagate through glass, for example. We 
have seen earlier that the total electric and magnetic fields inside a 
medium are described in terms of a permittivity £ and a magnetic 
permeability p (these describe the factors by which the total fields differ 
from the external fields). These replace £ 0 and p 0 in the description to 

electric and magnetic fields in Maxwell’s equations with the result that in 
a material medium of permittivity £ and magnetic permeability p, the 
velocity of light becomes, 

1 

u = ~j= ( 8 . 11 ) 

Thus, the velocity of light depends on electric and magnetic properties of 
the medium. We shall see in the next chapter that the refractive index of 
one medium with respect to the other is equal to the ratio of velocities of 
light in the two media. 

The velocity of electromagnetic waves in free space or vacuum is an 
important fundamental constant. It has been shown by experiments on 
electromagnetic waves of different wavelengths that this velocity is the 
same (independent of wavelength) to within a few metres per second, out 

of a value of 3x10 8 m/s. The constancy of the velocity of em waves in 
vacuum is so strongly supported by experiments and the actual value is 
so well known now that this is used to define a standard of length. 
Namely, the metre is now defined as the distance travelled by light in 

vacuum in a time (1/c) seconds = (2.99792458 * 10 8 ) -1 seconds. This 
has come about for the following reason. The basic unit of time can be 
defined very accurately in terms of some atomic frequency, i.e., 
frequency of light emitted by an atom in a particular process. The basic 
unit of length is harder to define as accurately in a direct way. Earlier 



measurement of c using earlier units of length (metre rods, etc.) 

converged to a value of about 2.9979246 * 10 8 m/s. Since c is such a 
strongly fixed number, unit of length can be defined in terms of c and the 
unit of time! 

Hertz not only showed the existence of electromagnetic waves, but also 
demonstrated that the waves, which had wavelength ten million times 
that of the light waves, could be diffracted, refracted and polarised. Thus, 
he conclusively established the wave nature of the radiation. Further, he 
produced stationary electromagnetic waves and determined their 
wavelength by measuring the distance between two successive nodes. 
Since the frequency of the wave was known (being equal to the 
frequency of the oscillator), he obtained the speed of the wave using the 
formula v = vA and found that the waves travelled with the same speed 
as the speed of light. 

The fact that electromagnetic waves are polarised can be easily seen in 
the response of a portable AM radio to a broadcasting station. If an AM 
radio has a telescopic antenna, it responds to the electric part of the 
signal. When the antenna is turned horizontal, the signal will be greatly 
diminished. Some portable radios have horizontal antenna (usually inside 
the case of radio), which are sensitive to the magnetic component of the 
electromagnetic wave. Such a radio must remain horizontal in order to 
receive the signal. In such cases, response also depends on the 
orientation of the radio with respect to the station. 

Do electromagnetic waves carry energy and momentum like other 
waves? Yes, they do. We have seen in chapter 2 that in a region of free 

space with electric field E, there is an energy density (£ 0 E 2 /2). Similarly, 
as seen in Chapter 6, associated with a magnetic field B is a magnetic 
energy density (B 2 /2p 0 ). As electromagnetic wave contains both electric 



and magnetic fields, there is a non-zero energy density associated with it. 
Now consider a plane perpendicular to the direction of propagation of the 
electromagnetic wave (Fig. 8.4). If there are, on this plane, electric 
charges, they will be set and sustained in motion by the electric and 
magnetic fields of the electromagnetic wave. The charges thus acquire 
energy and momentum from the waves. This just illustrates the fact that 
an electromagnetic wave (like other waves) carries energy and 
momentum. Since it carries momentum, an electromagnetic wave also 
exerts pressure, called radiation pressure. 

If the total energy transferred to a surface in time t is U, it can be shown 
that the magnitude of the total momentum delivered to this surface (for 
complete absorption) is, 

P = - (8-12) 

c 

When the sun shines on your hand, you feel the energy being absorbed 
from the electromagnetic waves (your hands get warm). Electromagnetic 
waves also transfer momentum to your hand but because c is very large, 
the amount of momentum transferred is extremely small and you do not 
feel the pressure. In 1903, the American scientists Nicols and Hull 
succeeded in measuring radiation pressure of visible light and verified 

Eq. (8.12). It was found to be of the order of 7 x io -6 N/m 2 . Thus, on a 

surface of area 10 cm 2 , the force due to radiation is only about 7 x 10 -9 

N. 

The great technological importance of electromagnetic waves stems from 
their capability to carry energy from one place to another. The radio and 
TV signals from broadcasting stations carry energy. Light carries energy 
from the sun to the earth, thus making life possible on the earth. 


Example 8.2 A plane electromagnetic wave of frequency 

25 MHz travels in free space along the x-direction. At a particular 

point in space and time, E = 6.3 j V/m. What is B at this point? 

Solution Using Eq. (8.10), the magnitude of B is 


C 


6.3 V/m 


-2.1x10 T 



To find the direction, we note that E is along y-direction and the wave 
propagates along x-axis. Therefore, B should be in a direction 
perpendicular to both x- and y-axes. Using vector algebra, E * B 
should be along x-direction. Since, (+ j) x (+ k) = i, B is along the z- 
direction. 

Thus, B = 2.1 x IQ- 8 k T 


Example 8.3 The magnetic field in a plane electromagnetic wave is 
given by By = (2 x 10 _7 )T sin (0.5*10 3 x+1.5*10 11 t). 

(a) What is the wavelength and frequency of the wave? 

(b) Write an expression for the electric field. 

Solution 

(a) Comparing the given equation with 










We get, A -—- m = 1.26 cm, 

0 .ox 10 

and — = v = iL5 xlO 11 )/ 2 tt= 23.9 GHz 
T v } 

(b) EO = BOc = 2><10 -7 T x 3 x 10 8 m/s = 6 x 10 1 V/m 

The electric field component is perpendicular to the direction of 
propagation and the direction of magnetic field. Therefore, the electric 
field component along the z-axis is obtained as 

Ez = 60 sin (0.5 x I0 3 x+ 1.5 x 10 11 t) V/m 


Example 8.4 Light with an energy flux of 18 W/cm 2 falls on a non¬ 
reflecting surface at normal incidence. If the surface has an area of 

20 cm 2 , find the average force exerted on the surface during a 30 
minute time span. 

Solution 

The total energy falling on the surface is 
U = (18 W/cm 2 ) x (20 cm 2 ) x (30 x 60) 

= 6.48 x 10 5 J 


Therefore, the total momentum delivered (for complete absorption) is 



GAS x 10"'J 
3 x 1 f) ! 'm / s 


= 2.16 x io -3 kg m/s 


The average force exerted on the surface is 








p _ p 2 . 1G v 10 


1.2x1 (T ,J M 


t 0. IS x 10 

How will your result be modified if the surface is a perfect reflector? 


Example 8.5 Calculate the electric and magnetic fields produced by 
the radiation coming from a 100 W bulb at a distance of 3 m. Assume 
that the efficiency of the bulb is 2.5% and it is a point source. 

Solution The bulb, as a point source, radiates light in all directions 
uniformly. At a distance of 3 m, the surface area of the surrounding 
sphere is 


A-A%r -4*0)* - 113 nr 

The intensity I at this distance is 



/ = 


Area 113 in' 


= 0.022 W/m 2 

Half of this intensity is provided by the electric field and half by the 
magnetic field. 



(0.022 W/m 



= 2.9 V/m 











The value of E found above is the root mean square value of the 
electric field. Since the electric field in a light beam is sinusoidal, the 
peak electric field, EO is 

EO = 72 E, ir)S = 72x2.9V/m 

= 4.07 V/m 


Thus, you see that the electric field strength of the light that you use 
for reading is fairly large. Compare it with electric field strength of TV 
or FM waves, which is of the order of a few microvolts per metre. 


Now, let us calculate the strength of the magnetic field. It is 




c 


2.9 Vnr 1 
x 1 O y in s 1 


= 9.6 x 10“ 9 T 


Again, since the field in the light beam is sinusoidal, the peak 
magnetic field is BO = v 2 Brms = 1.4 x 10 -8 T. Note that although 
the energy in the magnetic field is equal to the energy in the electric 
field, the magnetic field strength is evidently very weak. 


8.4 Electromagnetic Spectrum 

At the time Maxwell predicted the existence of electromagnetic waves, 
the only familiar electromagnetic waves were the visible light waves. The 
existence of ultraviolet and infrared waves was barely established. By the 
end of the nineteenth century, X-rays and gamma rays had also been 
discovered. We now know that, electromagnetic waves include visible 
light waves, X-rays, gamma rays, radio waves, microwaves, ultraviolet 
and infrared waves. The classification of em waves according to 
frequency is the electromagnetic spectrum (Fig. 8.5). There is no sharp 
division between one kind of wave and the next. The classification is 





based roughly on how the waves are produced and/or detected. 

We briefly describe these different types of electromagnetic waves, in 
order of decreasing wavelengths. 
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Figure 8.5 The electromagnetic spectrum, with common names for various part of it. The various 

regions do not have sharply defined boundaries. 
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Electromagnetic spectrum 

http://www.fnal.gov/pub/inquiring/more/light 

http://imagine.gsfc.nasa.gov/docs/science/ 

8.4.1 Radio waves 

Radio waves are produced by the accelerated motion of charges in 
conducting wires. They are used in radio and television communication 
systems. They are generally in the frequency range from 500 kHz to 
about 1000 MHz. The AM (amplitude modulated) band is from 530 kHz to 
1710 kHz. Higher frequencies upto 54 MHz are used for short wave 
bands. TV waves range from 54 MHz to 890 MHz. The FM (frequency 
modulated) radio band extends from 88 MHz to 108 MHz. Cellular 
phones use radio waves to transmit voice communication in the ultrahigh 
frequency (UHF) band. How these waves are transmitted and received is 
described in Chapter 15. 

8.4.2 Microwaves 

Microwaves (short-wavelength radio waves), with frequencies in the 
gigahertz (GHz) range, are produced by special vacuum tubes (called 
klystrons, magnetrons and Gunn diodes). Due to their short wavelengths, 
they are suitable for the radar systems used in aircraft navigation. Radar 
also provides the basis for the speed guns used to time fast balls, tennis- 
serves, and automobiles. Microwave ovens are an interesting domestic 
application of these waves. In such ovens, the frequency of the 
microwaves is selected to match the resonant frequency of water 



molecules so that energy from the waves is transferred efficiently to the 
kinetic energy of the molecules. This raises the temperature of any food 
containing water. 

8.4.3 Infrared waves 

Infrared waves are produced by hot bodies and molecules. This band lies 
adjacent to the low-frequency or long-wave length end of the visible 
spectrum. Infrared waves are sometimes referred to as heat waves. This 
is because water molecules present in most materials readily absorb 
infrared waves (many other molecules, for example, C0 2 , NH 3 , also 

absorb infrared waves). After absorption, their thermal motion increases, 
that is, they heat up and heat their surroundings. Infrared lamps are used 
in physical therapy. Infrared radiation also plays an important role in 
maintaining the earth’s warmth or average temperature through the 
greenhouse effect. Incoming visible light (which passes relatively easily 
through the atmosphere) is absorbed by the earth’s surface and re¬ 
radiated as infrared (longer wavelength) radiations. This radiation is 
trapped by greenhouse gases such as carbon dioxide and water vapour. 
Infrared detectors are used in Earth satellites, both for military purposes 
and to observe growth of crops. Electronic devices (for example 
semiconductor light emitting diodes) also emit infrared and are widely 
used in the remote switches of household electronic systems such as TV 
sets, video recorders and hi-fi systems. 


Microwave oven 

The spectrum of electromagnetic radiation contains a part known as 
microwaves. These waves have frequency and energy smaller than 
visible light and wavelength larger than it. What is the principle of a 



microwave oven and how does it work? 



Our objective is to cook food or warm it up. All food items such as 
fruit, vegetables, meat, cereals, etc., contain water as a constituent. 
Now, what does it mean when we say that a certain object has 
become warmer? When the temperature of a body rises, the energy 
of the random motion of atoms and molecules increases and the 
molecules travel or vibrate or rotate with higher energies. The 
frequency of rotation of water molecules is about 
2.45 gigahertz (GHz). If water receives microwaves of this frequency, 
its molecules absorb this radiation, which is equivalent to heating up 
water. These molecules share this energy with neighbouring food 
molecules, heating up the food. 

One should use porcelain vessels and not metal containers in a 
microwave oven because of the danger of getting a shock from 
accumulated electric charges. Metals may also melt from heating. 
The porcelain container remains unaffected and cool, because its 
large molecules vibrate and rotate with much smaller frequencies, 
and thus cannot absorb microwaves. Hence, they do not get heated 
up. 

Thus, the basic principle of a microwave oven is to generate 
microwave radiation of appropriate frequency in the working space of 
the oven where we keep food. This way energy is not wasted in 
heating up the vessel. In the conventional heating method, the vessel 
on the burner gets heated first, and then the food inside gets heated 





because of transfer of energy from the vessel. In the microwave 
oven, on the other hand, energy is directly delivered to water 
molecules which is shared by the entire food. 


8.4.4 Visible rays 

It is the most familiar form of electromagnetic waves. It is the part of the 
spectrum that is detected by the human eye. It runs from about 4 x 10 14 

Hz to about 7 x io 14 Hz or a wavelength range of about 700 - 400 nm. 
Visible light emitted or reflected from objects around us provides us 
information about the world. Our eyes are sensitive to this range of 
wavelengths. Different animals are sensitive to different range of 
wavelengths. For example, snakes can detect infrared waves, and the 
‘visible’ range of many insects extends well into the utraviolet. 

8.4.5 Ultraviolet rays 

It covers wavelengths ranging from about 4 x io -7 m (400 nm) down to 6 

x 10 -10 m (0.6 nm). Ultraviolet (uv) radiation is produced by special lamps 
and very hot bodies. The sun is an important source of ultraviolet light. 
But fortunately, most of it is absorbed in the ozone layer in the 
atmosphere at an altitude of about 40 - 50 km. uv light in large quantities 
has harmful effects on humans. Exposure to UV radiation induces the 
production of more melanin, causing tanning of the skin. UV radiation is 
absorbed by ordinary glass. Hence, one cannot get tanned or sunburn 
through glass windows. 

Welders wear special glass goggles or face masks with glass windows to 
protect their eyes from large amount of UV produced by welding arcs. 
Due to its shorter wavelengths, UV radiations can be focussed into very 



narrow beams for high precision applications such as LASIK (Laser- 
assisted in situ keratomileusis) eye surgery. UV lamps are used to kill 
germs in water purifiers. 

Ozone layer in the atmosphere plays a protective role, and hence its 
depletion by chlorofluorocarbons (CFCs) gas (such as freon) is a matter 
of international concern. 

8.4.6 X-rays 

Beyond the uv region of the electromagnetic spectrum lies the x-ray 
region. We are familiar with x-rays because of its medical applications. It 

covers wavelengths from about 10 -8 m (10 nm) down to 10 -13 m (10 -4 
nm). One common way to generate X-rays is to bombard a metal target 
by high energy electrons. X-rays are used as a diagnostic tool in 
medicine and as a treatment for certain forms of cancer. Because x-rays 
damage or destroy living tissues and organisms, care must be taken to 
avoid unnecessary or over exposure. 

8.4.7 Gamma rays 

They lie in the upper frequency range of the electromagnetic spectrum 

and have wavelengths of from about 10 -1 °m to less than 10 -14 m. This 
high frequency radiation is produced in nuclear reactions and also 
emitted by radioactive nuclei. They are used in medicine to destroy 
cancer cells. 

Table 8.1 summarises different types of electromagnetic waves, their 
production and detections. As mentioned earlier, the demarcation 
between different regions is not sharp and there are overlaps. 



Table 8.1 Different types of electromagnetic waves 


Type 

Wavelength range 

Production 

Detection 

Radio 

> 0,1 m 

Rapid acceleration and 
decelerations of electrons 
in aerial s 

Receiver's aerials 

Microwave 

0. lm to I mm 

Klystron valve or 
magnetron valve 

Point contact diodes 

Infra-red 

lmm to 700 nm 

Vibration of atoms 
and molecules 

Thermopiles 
Bolometer* Infrared 
photographic film 

light 

700 nm to 400 nm 

Electrons In atoms emit 
light when they move from 
one energy level to a 
lower energy level 

The ey e 

Photocells 
Photographic film 

Ultraviolet 

400 nm to lnm 

Inner shell electrons In 
atoms moving from one 
energy level to a lower level 

Photocells 
Photographic film 

X-rays 

lnm to 10 -3 nm 

X-ray tubes or inner shell 
electrons 

Photographic film 
Geiger tubes 
Ionisation chamber 

Gamma rays 

<10^ nm 

Radioactive decay of the 
nucleus 

-do- 


Summary 

1. Maxwell found an inconsistency in the Ampere’s law and 
suggested the existence of an additional current, called displacement 
current, to remove this inconsistency. This displacement current is 
due to time-varying electric field and is given by 

d 0 

'■ ' 

and acts as a source of magnetic field in exactly the same way as 
conduction current. 

2. An accelerating charge produces electromagnetic waves. An 







electric charge oscillating harmonically with frequency v, produces 
electromagnetic waves of the same frequency v. An electric dipole is 
a basic source of electromagnetic waves. 

3. Electromagnetic waves with wavelength of the order of a few 
metres were first produced and detected in the laboratory by Hertz in 
1887. He thus verified a basic prediction of Maxwell’s equations. 


4. Electric and magnetic fields oscillate sinusoidally in space and time 
in an electromagnetic wave. The oscillating electric and magnetic 
fields, E and B are perpendicular to each other, and to the direction of 
propagation of the electromagnetic wave. For a wave of frequency v, 
wavelength A, propagating along z-direction, we have 

E = Ex (t) = EO sin (kz - w t) 


= EO sin 



= E 0 sin 



r 


T 


B = By(t) = BO sin (kz -wt) 



Sill 



= B 0 sin 




They are related by EO/BO = c. 

5. The speed c of electromagnetic wave in vacuum is related to pO 
and sO (the free space permeability and permittivity constants) as 
follows: r 1 / V//i, *■', ■ The value of c equals the speed of light 
obtained from optical measurements. 

Light is an electromagnetic wave; c is, therefore, also the speed of 
light. Electromagnetic waves other than light also have the same 
velocity c in free space. 

The speed of light, or of electromagnetic waves in a material medium 













is given by r l /^uf 

where |j is the permeability of the medium and s its permittivity. 

6. Electromagnetic waves carry energy as they travel through space 
and this energy is shared equally by the electric and magnetic fields. 

Electromagnetic waves transport momentum as well. When these 
waves strike a surface, a pressure is exerted on the surface. If total 
energy transferred to a surface in time t is U, total momentum 
delivered to this surface is p = U/c. 

7. The spectrum of electromagnetic waves stretches, in principle, 
over an infinite range of wavelengths. Different regions are known by 
different names; y-rays, X-rays, ultraviolet rays, visible rays, infrared 
rays, microwaves and radio waves in order of increasing wavelength 
from 10-2 A or 10-12 m to 106 m. 

They interact with matter via their electric and magnetic fields which 
set in oscillation charges present in all matter. The detailed 
interaction and so the mechanism of absorption, scattering, etc., 
depend on the wavelength of the electromagnetic wave, and the 
nature of the atoms and molecules in the medium. 


Points to Ponder 

1. The basic difference between various types of electromagnetic 
waves lies in their wavelengths or frequencies since all of them travel 
through vacuum with the same speed. Consequently, the waves differ 
considerably in their mode of interaction with matter. 

2. Accelerated charged particles radiate electromagnetic waves. The 







wavelength of the electromagnetic wave is often correlated with the 
characteristic size of the system that radiates. Thus, gamma 
radiation, having wavelength of 10-14 m to 10-15 m, typically 
originate from an atomic nucleus. X-rays are emitted from heavy 
atoms. Radio waves are produced by accelerating electrons in a 
circuit. A transmitting antenna can most efficiently radiate waves 
having a wavelength of about the same size as the antenna. Visible 
radiation emitted by atoms is, however, much longer in wavelength 
than atomic size. 

3. The oscillating fields of an electromagnetic wave can accelerate 
charges and can produce oscillating currents. Therefore, an 
apparatus designed to detect electromagnetic waves is based on this 
fact. Hertz original ‘receiver’ worked in exactly this way. The same 
basic principle is utilised in practically all modern receiving devices. 
High frequency electromagnetic waves are detected by other means 
based on the physical effects they produce on interacting with matter. 

4. Infrared waves, with frequencies lower than those of visible light, 
vibrate not only the electrons, but entire atoms or molecules of a 
substance. This vibration increases the internal energy and 
consequently, the temperature of the substance. This is why infrared 
waves are often called heat waves. 

5. The centre of sensitivity of our eyes coincides with the centre of 
the wavelength distribution of the sun. It is because humans have 
evolved with visions most sensitive to the strongest wavelengths from 
the sun. 




Exercises 


8.1 Figure 8.6 shows a capacitor made of two circular plates each of 
radius 12 cm, and separated by 5.0 cm. The capacitor is being 
charged by an external source (not shown in the figure). The 
charging current is constant and equal to 0.15A. 

(a) Calculate the capacitance and the rate of change of potential 
difference between the plates. 

(b) Obtain the displacement current across the plates. 

(c) Is Kirchhoff’s first rule (junction rule) valid at each plate of the 
capacitor? Explain. 



Figure 8.6 


8.2 A parallel plate capacitor (Fig. 8.7) made of circular plates each of 
radius R = 6.0 cm has a capacitance C = 100 pF. The capacitor is 
connected to a 230 V ac supply with a (angular) frequency of 300 rad 

s -1 . 

(a) What is the rms value of the conduction current? 

(b) Is the conduction current equal to the displacement current? 

(c) Determine the amplitude of B at a point 3.0 cm from the axis 
between the plates. 











Figure 8.7 

8.3 What physical quantity is the same for X-rays of wavelength 

1CT 10 m, red light of wavelength 6800 A and radiowaves of 
wavelength 500m? 

8.4 A plane electromagnetic wave travels in vacuum along z- 
direction. What can you say about the directions of its electric and 
magnetic field vectors? If the frequency of the wave is 30 MHz, what 
is its wavelength? 

8.5 A radio can tune in to any station in the 7.5 MHz to 12 MHz band. 
What is the corresponding wavelength band? 

8.6 A charged particle oscillates about its mean equilibrium position 

with a frequency of 10 9 Hz. What is the frequency of the 
electromagnetic waves produced by the oscillator? 

8.7 The amplitude of the magnetic field part of a harmonic 
electromagnetic wave in vacuum is B0 = 510 nT. What is the 
amplitude of the electric field part of the wave? 

8.8 Suppose that the electric field amplitude of an electromagnetic 
wave is E0 = 120 N/C and that its frequency is v = 50.0 MHz. (a) 
Determine, B0 ,qj, k, and A. (b) Find expressions for E and B. 

8.9 The terminology of different parts of the electromagnetic spectrum 
is given in the text. Use the formula E = hv (for energy of a quantum 
of radiation: photon) and obtain the photon energy in units of eV for 









different parts of the electromagnetic spectrum. In what way are the 
different scales of photon energies that you obtain related to the 
sources of electromagnetic radiation? 

8.10 In a plane electromagnetic wave, the electric field oscillates 
sinusoidally at a frequency of 2.0 * 10 10 Hz and amplitude 48 V m -1 . 

(a) What is the wavelength of the wave? 

(b) What is the amplitude of the oscillating magnetic field? 

(c) Show that the average energy density of the E field equals the 
average energy density of the B field, [c = 3 * 10 8 m s -1 .] 


Additional Exercises 

8.11 Suppose that the electric field part of an electromagnetic wave in 
vacuum is E = {(3.1 N/C) cos [(1.8 rad/m) y + (5.4 x 10 6 rad/s)t]}j. 

(a) What is the direction of propagation? 

(b) What is the wavelength A ? 

(c) What is the frequency v ? 

(d) What is the amplitude of the magnetic field part of the wave? 

(e) Write an expression for the magnetic field part of the wave. 

8.12 About 5% of the power of a 100 W light bulb is converted to 
visible radiation. What is the average intensity of visible radiation 

(a) at a distance of 1m from the bulb? 

(b) at a distance of 10 m? 

Assume that the radiation is emitted isotropically and neglect 





reflection. 

8.13 Use the formula Am T = 0.29 cmK to obtain the characteristic 
temperature ranges for different parts of the electromagnetic 
spectrum. What do the numbers that you obtain tell you? 

8.14 Given below are some famous numbers associated with 
electromagnetic radiations in different contexts in physics. State the 
part of the electromagnetic spectrum to which each belongs. 

(a) 21 cm (wavelength emitted by atomic hydrogen in interstellar 
space). 

(b) 1057 MHz (frequency of radiation arising from two close energy 
levels in hydrogen; known as Lamb shift). 

(c) 2.7 K [temperature associated with the isotropic radiation filling all 
space-thought to be a relic of the ‘big-bang’ origin of the universe]. 

(d) 5890 A - 5896 A [double lines of sodium] 

(e) 14.4 keV [energy of a particular transition in 57 Fe nucleus 
associated with a famous high resolution spectroscopic method 
(Mossbauer spectroscopy)]. 

8.15 Answer the following questions: 

(a) Long distance radio broadcasts use short-wave bands. Why? 

(b) It is necessary to use satellites for long distance TV transmission. 
Why? 

(c) Optical and radiotelescopes are built on the ground but X-ray 
astronomy is possible only from satellites orbiting the earth. Why? 

(d) The small ozone layer on top of the stratosphere is crucial for 
human survival. Why? 




(e) If the earth did not have an atmosphere, would its average 
surface temperature be higher or lower than what it is now? 

(f) Some scientists have predicted that a global nuclear war on the 
earth would be followed by a severe ‘nuclear winter’ with a 
devastating effect on life on earth. What might be the basis of this 
prediction? 
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Chapter Nine 


Ray Optics and Optical 
Instruments 


9.1 Introduction 

Nature has endowed the human eye (retina) with the sensitivity to detect 
electromagnetic waves within a small range of the electromagnetic 
spectrum. Electromagnetic radiation belonging to this region of the 
spectrum (wavelength of about 400 nm to 750 nm) is called light. It is 
mainly through light and the sense of vision that we know and interpret 
the world around us. 

There are two things that we can intuitively mention about light from 
common experience. First, that it travels with enormous speed and 
second, that it travels in a straight line. It took some time for people to 
realise that the speed of light is finite and measurable. Its presently 

accepted value in vacuum is c = 2.99792458 * 10 8 m s -1 . For many 

purposes, it suffices to take c = 3 * 10 8 m s _1 . The speed of light in 
vacuum is the highest speed attainable in nature. 

The intuitive notion that light travels in a straight line seems to contradict 
what we have learnt in Chapter 8, that light is an electromagnetic wave of 
wavelength belonging to the visible part of the spectrum. How to 
reconcile the two facts? The answer is that the wavelength of light is very 
small compared to the size of ordinary objects that we encounter 
commonly (generally of the order of a few cm or larger). In this situation, 
as you will learn in Chapter 10, a light wave can be considered to travel 




from one point to another, along a straight line joining them. The path is 
called a ray of light, and a bundle of such rays constitutes a beam of light. 

In this chapter, we consider the phenomena of reflection, refraction and 
dispersion of light, using the ray picture of light. Using the basic laws of 
reflection and refraction, we shall study the image formation by plane and 
spherical reflecting and refracting surfaces. We then go on to describe 
the construction and working of some important optical instruments, 
including the human eye. 


Particle model of light 

Newton’s fundamental contributions to mathematics, mechanics, and gravitation often blind 
us to his deep experimental and theoretical study of light. He made pioneering contributions 
in the field of optics. He further developed the corpuscular model of light proposed by 
Descartes. It presumes that light energy is concentrated in tiny particles called corpuscles. 
He further assumed that corpuscles of light were massless elastic particles. With his 
understanding of mechanics, he could come up with a simple model of reflection and 
refraction. It is a common observation that a ball bouncing from a smooth plane surface 
obeys the laws of reflection. When this is an elastic collision, the magnitude of the velocity 
remains the same. As the surface is smooth, there is no force acting parallel to the surface, 
so the component of momentum in this direction also remains the same. Only the component 
perpendicular to the surface, i.e., the normal component of the momentum, gets reversed in 
reflection. Newton argued that smooth surfaces like mirrors reflect the corpuscles in a similar 
manner. 

In order to explain the phenomena of refraction, Newton postulated that the speed of the 
corpuscles was greater in water or glass than in air. However, later on it was discovered that 
the speed of light is less in water or glass than in air. 

In the field of optics, Newton - the experimenter, was greater than 
Newton - the theorist. He himself observed many phenomena, which 


were difficult to understand in terms of particle nature of light. For 
example, the colours observed due to a thin film of oil on water. 
Property of partial reflection of light is yet another such example. 
Everyone who has looked into the water in a pond sees image of the 
face in it, but also sees the bottom of the pond. Newton argued that 
some of the corpuscles, which fall on the water, get reflected and 
some get transmitted. But what property could distinguish these two 
kinds of corpuscles? Newton had to postulate some kind of 
unpredictable, chance phenomenon, which decided whether an 
individual corpuscle would be reflected or not. In explaining other 
phenomena, however, the corpuscles were presumed to behave as if 
they are identical. Such a dilemma does not occur in the wave picture 
of light. An incoming wave can be divided into two weaker waves at 
the boundary between air and water. 


9.2 Reflection of Light by Spherical Mirrors 

We are familiar with the laws of reflection. The angle of reflection (i.e., the 
angle between reflected ray and the normal to the reflecting surface or 
the mirror) equals the angle of incidence (angle between incident ray and 
the normal). Also that the incident ray, reflected ray and the normal to the 
reflecting surface at the point of incidence lie in the same plane 
(Fig. 9.1). These laws are valid at each point on any reflecting surface 
whether plane or curved. However, we shall restrict our discussion to the 
special case of curved surfaces, that is, spherical surfaces. The normal in 
this case is to be taken as normal to the tangent to surface at the point of 
incidence. That is, the normal is along the radius, the line joining the 
centre of curvature of the mirror to the point of incidence. 


Normal 



Figure 9.1 The incident ray, reflected ray and the normal to the reflecting surface lie in the same 

plane. 

We have already studied that the geometric centre of a spherical mirror is 
called its pole while that of a spherical lens is called its optical centre. 
The line joining the pole and the centre of curvature of the spherical 
mirror is known as the principal axis. In the case of spherical lenses, the 
principal axis is the line joining the optical centre with its principal focus 
as you will see later. 

9.2.1 Sign convention 

To derive the relevant formulae for reflection by spherical mirrors and 
refraction by spherical lenses, we must first adopt a sign convention for 
measuring distances. In this book, we shall follow the Cartesian sign 
convention. According to this convention, all distances are measured 
from the pole of the mirror or the optical centre of the lens. The distances 
measured in the same direction as the incident light are taken as positive 
and those measured in the direction opposite to the direction of incident 
light are taken as negative (Fig. 9.2). The heights measured upwards 
with respect to x-axis and normal to the principal axis (x-axis) of the 
mirror/lens are taken as positive (Fig. 9.2). The heights measured 





downwards are taken as negative. 


Object on left 


Mirror 


Heights 

upwards 

positive 


Incident light 

-► 


Distances against 
incident, light. 

negative 
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downwards Distances along 

negative incident light 

positive 

Figure 9.2 The Cartesian Sign Convention. 


■> x-axis 


With a common accepted convention, it turns out that a single formula for 
spherical mirrors and a single formula for spherical lenses can handle all 
different cases. 


9.2.2 Focal length of spherical mirrors 

Figure 9.3 shows what happens when a parallel beam of light is incident 
on (a) a concave mirror, and (b) a convex mirror. We assume that the 
rays are paraxial, i.e., they are incident at points close to the pole P of the 
mirror and make small angles with the principal axis. The reflected rays 
converge at a point F on the principal axis of a concave mirror [Fig. 
9.3(a)], 

For a convex mirror, the reflected rays appear to diverge from a point F 
on its principal axis [Fig. 9.3(b)]. The point F is called the principal focus 
of the mirror. If the parallel paraxial beam of light were incident, making 
some angle with the principal axis, the reflected rays would converge (or 









appear to diverge) from a point in a plane through F normal to the 
principal axis. This is called the focal plane of the mirror [Fig. 9.3(c)]. 
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Figure 9.3 Focus of a concave and convex mirror. 
















The distance between the focus F and the pole P of the mirror is called 
th efocal length of the mirror, denoted by f. We now show that f = R/2, 
where R is the radius of curvature of the mirror. The geometry of 
reflection of an incident ray is shown in Fig. 9.4. 

Let C be the centre of curvature of the mirror. Consider a ray parallel to 
the principal axis striking the mirror at M. Then CM will be perpendicular 
to the mirror at M. Let 0 be the angle of incidence, and MD be the 
perpendicular from M on the principal axis. Then, 

Z.MCP = 0 and ZMFP = 20 




(b) 

Figure 9.4 Geometry of reflection of an incident ray on (a) concave spherical mirror, and (b) 

convex spherical mirror. 


Now, 











ML) ML) 

tan0 =— and tan 20 =- (9.1) 

CD FD 

For small 0, which is true for paraxial rays, tan0 « 0, tan 20 » 20. 
Therefore, Eq. (9.1) gives 

ML) _ 0 ML) 

Yd ~cd 

or, FD = — (9.2) 

2 

Now, for small 0, the point D is very close to the point P. Therefore, FD = / 
and CD = R. Equation (9.2) then gives 

f=RA 2(9.3) 

9.2.3 The mirror equation 

If rays emanating from a point actually meet at another point after 
reflection and/or refraction, that point is called the image of the first point. 
The image is real if the rays actually converge to the point; it is virtual if 
the rays do not actually meet but appear to diverge from the point when 
produced backwards. An image is thus a point-to-point correspondence 
with the object established through reflection and/or refraction. 

In principle, we can take any two rays emanating from a point on an 
object, trace their paths, find their point of intersection and thus, obtain 
the image of the point due to reflection at a spherical mirror. In practice, 
however, it is convenient to choose any two of the following rays: 

(i) The ray from the point which is parallel to the principal axis. The 
reflected ray goes through the focus of the mirror. 








Figure 9.5 Ray diagram for image 

(ii) The ray passing through the centre of curvature of a concave mirror or 
appearing to pass through it for a convex mirror. The reflected ray simply 
retraces the path. 

(iii) The ray passing through (or directed towards) the focus of the 
concave mirror or appearing to pass through (or directed towards) the 
focus of a convex mirror. The reflected ray is parallel to the principal axis. 

(iv) The ray incident at any angle at the pole. The reflected ray follows 
laws of reflection. 

Figure 9.5 shows the ray diagram considering three rays. It shows the 
image A'B' (in this case, real) of an object AB formed by a concave 
mirror. It does not mean that only three rays emanate from the point A. 
An infinite number of rays emanate from any source, in all directions. 
Thus, point A' is image point of A if every ray originating at point A and 
falling on the concave mirror after reflection passes through the point A'. 

We now derive the mirror equation or the relation between the object 
distance ( u ), image distance (v) and the focal length ( f ). 










From Fig. 9.5, the two right-angled triangles A'B'F and MPF are similar. 
(For paraxial rays, MP can be considered to be a straight line 
perpendicular to CP.) Therefore, 


B'A' _ B F 
PM ” FP 

B'A' B'F 

OI ’ ~BA = ^FP ( -- PM=AB1 t9 - 4) 

Since Z APB = Z A'PB', the light angled triangles A'B'P and ABP are 
also similar. Therefore, 


B'A' B P 
BA" BP 


[9.5) 


Comparing Eqs. (9.4) and (9.5), we get 

BF B'P-FP B'P 
FP " FP BP 


[9.6) 


Equation (9.6) is a relation involving magnitude of distances. We now 
apply the sign convention. We note that light travels from the object to the 
mirror MPn. Hence this is taken as the positive direction. To reach the 
object AB, image A'B' as well as the focus F from the pole P, we have to 
travel opposite to the direction of incident light. Hence, all the three will 
have negative signs. Thus, 

B' P = -v, FP = -f, BP = -u 

Using these in Eq. (9.6), we get 

- V + J _ - L 1 
-/ 

L'- f V 

or — = - 
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Dividing it by v, we get 
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(9.7) 


This relation is known as the mirror equation. 

The size of the image relative to the size of the object is another 
important quantity to consider. We define linear magnification (m) as the 
ratio of the height of the image (/?') to the height of the object ( h ): 

m= h (9.8) 
h 


h and h' will be taken positive or negative in accordance with the 
accepted sign convention. In triangles A'B'P and ABP, we have, 

BA' BT> 

BA BP 


With the sign convention, this becomes 

-h' _ — j.’ 
h ~ -u 


so that 

h / P 

m= — --- (9.9) 

h a 

We have derived here the mirror equation, Eq. (9.7), and the 
magnification formula, Eq. (9.9), for the case of real, inverted image 
formed by a concave mirror. With the proper use of sign convention, 
these are, In fact, valid for all the cases of reflection by a spherical mirror 
(concave or convex) whether the image formed is real or virtual. Figure 
9.6 shows the ray diagrams for virtual image formed by a concave and 
convex mirror. You should verify that Eqs. (9.7) and (9.9) are valid for 
these cases as well. 



(a) (b) 

Figure 9.6 Image formation by (a) a concave mirror with object between P and F, and (b) a 

convex mirror. 


Example 9.1 Suppose that the lower half of the concave mirror’s reflecting surface in Fig. 9.5 
is covered with an opaque (non-reflective) material. What effect will this have on the image of 
an object placed in front of the mirror? 

Solution You may think that the image will now show only half of the object, but taking the 
laws of reflection to be true for all points of the remaining part of the mirror, the image will be 
that of the whole object. However, as the area of the reflecting surface has been reduced, the 
intensity of the image will be low (in this case, half). 

Example 9.2 A mobile phone lies along the principal axis of a concave mirror, as shown in 
Fig. 9.7. Show by suitable diagram, the formation of its image. Explain why the magnification 
is not uniform. Will the distortion of image depend on the location of the phone with respect to 
the mirror? 






















The image is 30 cm from the mirror on the same side as the object. 


Also, magnification m = 


( 30) 


The image is magnified, real and inverted. 


(ii) The object distance u = -5 cm. Then from Eq. (9.7), 


1 1 


i: —;> -/.I) 


or r . _ -15 cm 

(7.,) —5 i 


This image is formed at 15 cm behind the mirror. It is a virtual image. 


v 15 „ 

Magnification m = - = - = ,. > 

N (. 5) 


The image is magnified, virtual and erect. 


Example 9.4 Suppose while sitting in a parked car, you notice a jogger approaching towards 

you in the side view mirror of R = 2 m. If the jogger is running at a speed of 5 m s -1 , how fast 
the image of the jogger appear to move when the jogger is (a) 39 m, (b) 29 m, (c) 19 m, and 
(d) 9 m away. 


Solution 


From the mirror equation, Eq. (9.7), we get 

lit 

v 

U-.f 

For convex mirror, since R = 2 m, f = 1 m. Then 




i-:j 9] * i :m 

for u =-39 m, v - -in 

-63-1 lu 


Since the jogger moves at a constant speed of 5 m s \ after 1 s the position of the 
image v (for u = -39 + 5 = -34) is (34/35 )m. 

The shift in the position of image in 1 s is 

39 34 1365-1360 5 1 

10 35 MOO MOO 230 

Therefore, the average speed of the image when the jogger is between 
39 m and 34 m from the mirror, is (1/280) m s -1 


Similarly, it can be seen that for u = -29 m, -19 m and -9 m, the speed with which the image 
appears to move is 


-Ill s 

150 


-ill s 1 and-ill s ', respectively. 

60 10 


Although the jogger has been moving with a constant speed, the speed of his/her image 
appears to increase substantially as he/she moves closer to the mirror. This phenomenon 
can be noticed by any person sitting in a stationary car or a bus. In case of moving vehicles, 
a similar phenomenon could be observed if the vehicle in the rear is moving closer with a 
constant speed. 


9.3 Refraction 

When a beam of light encounters another transparent medium, a part of 
light gets reflected back into the first medium while the rest enters the 
other. A ray of light represents a beam. The direction of propagation of an 
obliquely incident (0°< / < 90°) ray of light that enters the other medium, 
changes at the interface of the two media. This phenomenon is called 









refraction of light. Snell experimentally obtained the following laws of 
refraction: 


(i) The incident ray, the refracted ray and the normal to the interface at 
the point of incidence, all lie in the same plane. 

(ii) The ratio of the sine of the angle of incidence to the sine of angle of 
refraction is constant. Remember that the angles of incidence (/ ) and 
refraction (r) are the angles that the incident and its refracted ray make 
with the normal, respectively. We have 


sin i 

-”21 

si n r 


(9.10) 


where r? 21 is a constant, called the refractive index of the second medium 

with respect to the first medium. Equation (9.10) is the well-known Snell’s 
law of refraction. We note that n 21 is a characteristic of the pair of media 

(and also depends on the wavelength of light), but is independent of the 
angle of incidence. 

(1) Normal 



Refracted ray 










Figure 9.8 Refraction and reflection of light. 


From Eq. (9.10), if n 21 > 1. r< i, i.e., the refracted ray bends towards the 

normal. In such a case medium 2 is said to be optically denser (or 
denser, in short) than medium 1. On the other hand, if r? 21 <1, r > /', the 

refracted ray bends away from the normal. This is the case when incident 
ray in a denser medium refracts into a rarer medium. 

Note: Optical density should not be confused with mass density, which is 
mass per unit volume. It is possible that mass density of an optically 
denser medium may be less than that of an optically rarer medium 
(optical density is the ratio of the speed of light in two media). For 
example, turpentine and water. Mass density of turpentine is less than 
that of water but its optical density is higher. 

If n 2 1 is the refractive index of medium 2 with respect to medium 1 and 
n 12 the refractive index of medium 1 with respect to medium 2, then it 
should be clear that 

n L ,= 1 (9.11) 

n 2l 

It also follows that if n 32 is the refractive index of medium 3 with respect 
to medium 2 then n 32 = n 31 * n 12 , where r? 31 is the refractive index of 
medium 3 with respect to medium 1. 




Some elementary results based on the laws of refraction follow 
immediately. For a rectangular slab, refraction takes place at two 
interfaces (air-glass and glass-air). It is easily seen from Fig. 9.9 that r 2 = 

i -|, i.e., the emergent ray is parallel to the incident ray—there is no 

deviation, but it does suffer lateral displacement/shift with respect to the 
incident ray. Another familiar observation is that the bottom of a tank filled 
with water appears to be raised (Fig. 9.10). For viewing near the normal 
direction, it can be shown that the apparent depth ( h -|) is real depth ( h 2 ) 

divided by the refractive index of the medium (water). 

The refraction of light through the atmosphere is responsible for many 
interesting phenomena. For example, the Sun is visible a little before the 
actual sunrise and until a little after the actual sunset due to refraction of 
light through the atmosphere (Fig. 9.11). By actual sunrise we mean the 
actual crossing of the horizon by the sun. Figure 9.11 shows the actual 
and apparent positions of the Sun with respect to the horizon. The figure 
is highly exaggerated to show the effect. The refractive index of air with 
respect to vacuum is 1.00029. Due to this, the apparent shift in the 
direction of the Sun is by about half a degree and the corresponding time 
difference between actual sunset and apparent sunset is about 2 minutes 




(see Example 9.5). The apparent flattening (oval shape) of the Sun at 
sunset and sunrise is also due to the same phenomenon. 
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Figure 9.10 Apparent depth for (a) normal, and (b) oblique viewing. 


Apparent position 
of the Sun 



Figure 9.11 Advance sunrise and delayed sunset due to atmospheric refraction. 













Example 9.5 The earth takes 24 h to rotate once about its axis. How much time does the sun 
take to shift by 1 0 when viewed from the earth? 

Solution 

Time taken for 360° shift = 24 h 

Time taken for 1 ° shift = 24/360 h = 4 min. 


The drowning child, lifeguard and Snell’s law 

Consider a rectangular swimming pool PQSR; see figure here. A lifeguard sitting at G outside 
the pool notices a child drowning at a point C. The guard wants to reach the child in the 
shortest possible time. Let SR be the side of the pool between G and C. Should he/she take 
a straight line path GAC between G and C or GBC in which the path BC in water would be 
the shortest, or some other path GXC? The guard knows that his/her running speed v-| on 

ground is higher than his/her swimming speed V 2 - 


P Q 



M 








Suppose the guard enters water at X. Let GX =/-, and XC =/ 2 . Then 
the time taken to reach from G to C would be 


To make this time minimum, one has to differentiate it (with respect to 
the coordinate of X) and find the point X when t is a minimum. On 
doing all this algebra (which we skip here), we find that the guard 
should enter water at a point where Snell’s law is satisfied. To 
understand this, draw a perpendicular LM to side SR at X. Let Z.GXM 
= / and Z.CXL = r. Then it can be seen that t is minimum when 

sin i r, 
sin r u 2 

In the case of light v^/v 2 , the ratio of the velocity of light in vacuum to 

that in the medium, is the refractive index n of the medium. 

In short, whether it is a wave or a particle or a human being, 
whenever two mediums and two velocities are involved, one must 
follow Snell’s law if one wants to take the shortest time. 


9.4 Total Internal Reflection 

When light travels from an optically denser medium to a rarer medium at 
the interface, it is partly reflected back into the same medium and partly 
refracted to the second medium. This reflection is called the internal 
reflection. 

When a ray of light enters from a denser medium to a rarer medium, it 
bends away from the normal, for example, the ray AO-| B in Fig. 9.12. The 




incident ray A0-| is partially reflected (0-|C) and partially transmitted 

(0-|B) or refracted, the angle of refraction (r) being larger than the angle 

of incidence (/'). As the angle of incidence increases, so does the angle of 
refraction, till for the ray A0 3 , the angle of refraction is tt/2. The refracted 

ray is bent so much away from the normal that it grazes the surface at 
the interface between the two media. This is shown by the ray A0 3 D in 

Fig. 9.12. If the angle of incidence is increased still further (e.g., the ray 
A0 4 ), refraction is not possible, and the incident ray is totally reflected. 

This is called total internal reflection. When light gets reflected by a 
surface, normally some fraction of it gets transmitted. The reflected ray, 
therefore, is always less intense than the incident ray, howsoever smooth 
the reflecting surface may be. In total internal reflection, on the other 
hand, no transmission of light takes place. 

The angle of incidence corresponding to an angle of refraction 90°, say 
ZA0 3 N, is called the critical angle (/ c ) for the given pair of media. We 

see from Snell’s law [Eq. (9.10)] that if the relative refractive index is less 
than one then, since the maximum value of sin r is unity, there is an 
upper limit to the value of sin / for which the law can be satisfied, that is, / 
= / c such that 


sin / c = n 21 (9.12) 

For values of / larger than / c , Snell’s law of refraction cannot be satisfied, 
and hence no refraction is possible. 

The refractive index of denser medium 1 with respect to rarer medium 2 
will be r? 12 = 1/sin/ c . Some typical critical angles are listed in Table 9.1. 




Figure 9.12 Refraction and internal reflection of rays from a point A in the denser medium (water) 
incident at different angles at the interface with a rarer medium (air). 


Table 9.1 Critical angle of some transparent media with respect to air 

Substance medium 

Refractive Index 

Critical angle 

Water 

1.33 

48.75 

Crown glass 

1.52 

41.14 

Dense flint glass 

1.62 

37.31 

Diamond 

2.42 

24.41 


A demonstration for total internal reflection 

All optical phenomena can be demonstrated very easily with the use of a 
laser torch or pointer, which is easily available nowadays. Take a glass 
beaker with clear water in it. Add a few drops of milk or any other 
suspension to water and stir so that water becomes a little turbid. Take a 
laser pointer and shine its beam through the turbid water. You will find 
that the path of the beam inside the water shines brightly. 












Shine the beam from below the beaker such that it strikes at the upper 
water surface at the other end. Do you find that it undergoes partial 
reflection (which is seen as a spot on the table below) and partial 
refraction [which comes out in the air and is seen as a spot on the roof; 
Fig. 9.13(a)]? Now direct the laser beam from one side of the beaker 
such that it strikes the upper surface of water more obliquely [Fig. 
9.13(b)]. Adjust the direction of laser beam until you find the angle for 
which the refraction above the water surface is totally absent and the 
beam is totally reflected back to water. This is total internal reflection at 
its simplest. 

Pour this water in a long test tube and shine the laser light from top, as 
shown in Fig. 9.13(c). Adjust the direction of the laser beam such that it is 
totally internally reflected every time it strikes the walls of the tube. This is 
similar to what happens in optical fibres. 

Take care not to look into the laser beam directly and not to point it at 
anybody’s face. 

9.4.1 Total internal reflection in nature and its technological 
applications 

(i) Mirage: On hot summer days, the air near the ground becomes hotter 
than the air at higher levels. The refractive index of air increases with its 
density. Hotter air is less dense, and has smaller refractive index than the 
cooler air. If the air currents are small, that is, the air is still, the optical 
density at different layers of air increases with height. As a result, light 
from a tall object such as a tree, passes through a medium whose 
refractive index decreases towards the ground. Thus, a ray of light from 
such an object successively bends away from the normal and undergoes 
total internal reflection, if the angle of incidence for the air near the 


ground exceeds the critical angle. This is shown in Fig. 9.14(b). To a 
distant observer, the light appears to be conning from somewhere below 
the ground. The observer naturally assumes that light is being reflected 
from the ground, say, by a pool of water near the tall object. Such 
inverted images of distant tall objects cause an optical illusion to the 
observer. This phenomenon is called mirage. This type of mirage is 
especially common in hot deserts. Some of you might have noticed that 
while moving in a bus or a car during a hot summer day, a distant patch 
of road, especially on a highway, appears to be wet. But, you do not find 
any evidence of wetness when you reach that spot. This is also due to 
mirage. 












Figure 9.13 Observing total internal reflection in water with a laser beam (refraction due to glass 

of beaker neglected being very thin). 


(ii) Diamond : Diamonds are known for their spectacular brilliance. Their 
brilliance is mainly due to the total internal reflection of light inside them. 
The critical angle for diamond-air interface (= 24.4°) is very small, 
therefore once light enters a diamond, it is very likely to undergo total 
internal reflection inside it. Diamonds found in nature rarely exhibit the 
brilliance for which they are known. It is the technical skill of a diamond 
cutter which makes diamonds to sparkle so brilliantly. By cutting the 
diamond suitably, multiple total internal reflections can be made to occur. 




Figure 9.14 (a) A tree is seen by an observer at its place when the air above the ground is at 
uniform temperature, (b) When the layers of air close to the ground have varying temperature with 








hottest layers near the ground, light from a distant tree may undergo total internal reflection, and 
the apparent image of the tree may create an illusion to the observer that the tree is near a pool of 

water. 




Figure 9.15 Prisms designed to bend rays by 90° and 180° or to invert image without changing its 

size make use of total internal reflection. 

(iii) Prism : Prisms designed to bend light by 90° or by 180° make use of 
total internal reflection [Fig. 9.15(a) and (b)]. Such a prism is also used to 
invert images without changing their size [Fig. 9.15(c)]. 

In the first two cases, the critical angle / c for the material of the prism 

must be less than 45°. We see from Table 9.1 that this is true for both 
crown glass and dense flint glass. 






















(iv) Optical fibres : Nowadays optical fibres are extensively used for 
transmitting audio and video signals through long distances. Optical 
fibres too make use of the phenomenon of total internal reflection. Optical 
fibres are fabricated with high quality composite glass/quartz fibres. Each 
fibre consists of a core and cladding. The refractive index of the material 
of the core is higher than that of the cladding. 

When a signal in the form of light is directed at one end of the fibre at a 
suitable angle, it undergoes repeated total internal reflections along the 
length of the fibre and finally comes out at the other end (Fig. 9.16). 
Since light undergoes total internal reflection at each stage, there is no 
appreciable loss in the intensity of the light signal. Optical fibres are 
fabricated such that light reflected at one side of inner surface strikes the 
other at an angle larger than the critical angle. Even if the fibre is bent, 
light can easily travel along its length. Thus, an optical fibre can be used 
to act as an optical pipe. 


I.OW n 



Figure 9.16 Light undergoes successive total internal reflections as it moves through an optical 

fibre. 

A bundle of optical fibres can be put to several uses. Optical fibres are 
extensively used for transmitting and receiving electrical signals which 
are converted to light by suitable transducers. Obviously, optical fibres 
can also be used for transmission of optical signals. For example, these 
are used as a ‘light pipe’ to facilitate visual examination of internal organs 





like esophagus, stomach and intestines. You might have seen a 
commonly available decorative lamp with fine plastic fibres with their free 
ends forming a fountain like structure. The other end of the fibres is fixed 
over an electric lamp. When the lamp is switched on, the light travels 
from the bottom of each fibre and appears at the tip of its free end as a 
dot of light. The fibres in such decorative lamps are optical fibres. 

The main requirement in fabricating optical fibres is that there should be 
very little absorption of light as it travels for long distances inside them. 
This has been achieved by purification and special preparation of 
materials such as quartz. In silica glass fibres, it is possible to transmit 
more than 95% of the light over a fibre length of 1 km. (Compare with 
what you expect for a block of ordinary window glass 1 km thick.) 

9.5 Refraction at Spherical Surfaces And By Lenses 

We have so far considered refraction at a plane interface. We shall now 
consider refraction at a spherical interface between two transparent 
media. An infinitesimal part of a spherical surface can be regarded as 
planar and the same laws of refraction can be applied at every point on 
the surface. Just as for reflection by a spherical mirror, the normal at the 
point of incidence is perpendicular to the tangent plane to the spherical 
surface at that point and, therefore, passes through its centre of 
curvature. We first consider refraction by a single spherical surface and 
follow it by thin lenses. A thin lens is a transparent optical medium 
bounded by two surfaces; at least one of which should be spherical. 
Applying the formula for image formation by a single spherical surface 
successively at the two surfaces of a lens, we shall obtain the lens 
maker’s formula and then the lens formula. 


9.5.1 Refraction at a spherical surface 

Figure 9.17 shows the geometry of formation of image / of an object O on 
the principal axis of a spherical surface with centre of curvature C, and 
radius of curvature R. The rays are incident from a medium of refractive 
index n to another of refractive index n 2 . As before, we take the 

aperture (or the lateral size) of the surface to be small compared to other 
distances involved, so that small angle approximation can be made. In 
particular, NM will be taken to be nearly equal to the length of the 
perpendicular from the point N on the principal axis. We have, for small 
angles, 



Figure 9.17 Refraction at a spherical surface separating two media. 
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Now, for ANOC, / is the exterior angle. Therefore, / = ZNOM + ZNCM 













(9.13) 


MN MN 
' OM ' MC 

Similarly, 

r- ZNCM-ZNIM 


Light sources and photometry 

It is known that a body above absolute zero temperature emits electromagnetic radiation. The 
wavelength region in which the body emits the radiation depends on its absolute temperature. 
Radiation emitted by a hot body, for example, a tungsten filament lamp having temperature 
2850 K are partly invisible and mostly in infrared (or heat) region. As the temperature of the 
body increases radiation emitted by it is in visible region. The sun with temperature of about 
5500 K emits radiation whose energy versus wavelength graph peaks approximately at 550 
nm corresponding to green light and is almost in the middle of the visible region. The energy 
versus wavelength distribution graph for a given body peaks at some wavelength, which is 
inversely proportional to the absolute temperature of that body. 

The measurement of light as perceived by human eye is called photometry. Photometry is 
measurement of a physiological phenomenon, being the stimulus of light as received by the 
human eye, transmitted by the optic nerves and analysed by the brain. The main physical 
quantities in photometry are (i) the luminous intensity of the source, 

(ii) the luminous flux or flow of light from the source, and (iii) illuminance of the surface. The 
SI unit of luminous intensity (/) is candela (cd). The candela is the luminous intensity, in a 
given direction, of a source that emits monochromatic radiation of frequency 

1 9 

540 x 10 Hz and that has a radiant intensity in that direction of 1/683 watt per steradian. If 
a light source emits one candela of luminous intensity into a solid angle of one steradian, the 
total luminous flux emitted into that solid angle is one lumen (Im). A standard 100 watt 
incadescent light bulb emits approximately 1700 lumens. 


In photometry, the only parameter, which can be measured directly is illuminance. It is defined 

O 

as luminous flux incident per unit area on a surface (Im/rrr or lux). Most light meters measure 
this quantity. The illuminance E, produced by a source of luminous intensity /, is given by E = 

I/P, where r is the normal distance of the surface from the source. A quantity named 
luminance (/.), is used to characterise the brightness of emitting or reflecting flat surfaces. Its 

O 

unit is cd/rrr (sometimes called ‘nit’ in industry). A good LCD computer monitor has a 
brightness of about 250 nits. 


i.e., r = 


MN 

MC 


MN 

MI 


(9.14) 


Now, by Snell’s law 
r? 1 sin / = n 2 sin r 


or for small angles 
np - n 2 r 


Substituting / and r from Eqs. (9.13) and (9.14), we get 


n, n,, n,,-n, 

mi MI Mr 


(9.15) 


Here, OM, Ml and MC represent magnitudes of distances. Applying the 
Cartesian sign convention, 

OM — —u, Ml — +v, MC = +R 

Substituting these in Eq. (9.15), we get 


Equation (9.16) gives us a relation between object and image distance in 


terms of refractive index of the medium and the radius of curvature of the 
curved spherical surface. It holds for any curved spherical surface. 


Example 9.6 Light from a point source in air falls on a spherical glass surface (n = 1.5 and 
radius of curvature = 20 cm). The distance of the light source from the glass surface is 100 
cm. At what position the image is formed? 

Solution 

We use the relation given by Eq. (9.16). Here 
u = - 100 cm, v = ?, R = + 20 cm, n-| = 1, and n 2 = 1.5. 

We then have 

1.5 1 0.5 

v + 100 ” 20 

or v = +100 cm 

The image is formed at a distance of 100 cm from the glass surface, in the direction of 
incident light. 


9.5.2 Refraction by a lens 

Figure 9.18(a) shows the geometry of image formation by a double 
convex lens. The image formation can be seen in terms of two steps: 

(i) The first refracting surface forms the image l-| of the object O 

[Fig. 9.18(b)]. The image ^ acts as a virtual object for the second surface 

that forms the image at I [Fig. 9.18(c)]. Applying Eq. (9.15) to the first 
interface ABC, we get 






n 


(9.17) 


n,. _ n, n, 

OB BI^ _ BC, 

A similar procedure applied to the second interface* ADC gives, 


n, n n,, - n 
DI J + DI ” DC, 


(9.18) 


* Note that now the refractive index of the medium on the right side of ADC is n-/ while on its left it 
is ng. Further DI ■/ is negative as the distance is measuredagainst the direction of incident Ugh 


For a thin lens, = DI-,. Adding 
Eqs. (9.17) and (9.18), we get 

Hi flj ( 1 1 

OB DI 2 O c i DC = 


Suppose the object is at infinity, i.e., 
OB —»• 00 and DI = f, Eq. (9.19) gives 


(9.19) 


(9.20) 


The point where image of an object placed at infinity is formed is called 
the focus F, of the lens and the distance f gives its focal length. A lens 
has two foci, F and F', on either side of it (Fig. 9.19). By the sign 
convention, 
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(b) 



(c) 

Figure 9.18 (a) The position of object, and the image formed by a double convex lens, 

(b) Refraction at the first spherical surface and 

(c) Refraction at the second spherical surface. 














BC 1 - + R -|, 


DO2 — R2 


So Eq. (9.20) can be written as 
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(9.21) 


Equation (9.21) is known as the lens maker’s formula. It is useful to 
design lenses of desired focal length using surfaces of suitable radii of 
curvature. Note that the formula is true for a concave lens also. In that 
case R-{\s negative, R 2 positive and therefore, f is negative. 


From Eqs. (9.19) and (9.20), we get 


“1 ! Ft i 

013 DI 


Hi 

I 


(9.22) 


Again, in the thin lens approximation, B and D are both close to the 
optical centre of the lens. Applying the sign convention, 

BO = - u, DI = +v, we get 

1 1 1 

-= - (9.23) 

u II J 

Equation (9.23) is the familiar thin lens formula. Though we derived it for 
a real image formed by a convex lens, the formula is valid for both 
convex as well as concave lenses and for both real and virtual images. 

It is worth mentioning that the two foci, F and F', of a double convex or 
concave lens are equidistant from the optical centre. The focus on the 
side of the (original) source of light is called the first focal point, whereas 
the other is called the second focal point. 

To find the image of an object by a lens, we can, in principle, take any 



two rays emanating from a point on an object; trace their paths using the 
laws of refraction and find the point where the refracted rays meet (or 
appear to meet). In practice, however, it is convenient to choose any two 
of the following rays: 

(i) A ray emanating from the object parallel to the principal axis of the lens 
after refraction passes through the second principal focus F' (in a convex 
lens) or appears to diverge (in a concave lens) from the first principal 
focus F. 

(ii) A ray of light, passing through the optical centre of the lens, emerges 
without any deviation after refraction. 

(iii) A ray of light passing through the first principal focus (for a convex 
lens) or appearing to meet at it (for a concave lens) emerges parallel to 
the principal axis after refraction. 



Object 




CbJ 

Figure 9.19 Tracing rays through (a) convex lens (b) concave lens. 

Figures 9.19(a) and (b) illustrate these rules for a convex and a concave 
lens, respectively. You should practice drawing similar ray diagrams for 
different positions of the object with respect to the lens and also verify 
that the lens formula, Eq. (9.23), holds good for all cases. 

Here again it must be remembered that each point on an object gives out 
infinite number of rays. All these rays will pass through the same image 
point after refraction at the lens. 

Magnification (m) produced by a lens is defined, like that for a mirror, as 
the ratio of the size of the image to that of the object. Proceeding in the 


















same way as for spherical mirrors, it is easily seen that for a lens 

m=- = 1 (9.24) 

h u 

When we apply the sign convention, we see that, for erect (and virtual) 
image formed by a convex or concave lens, m is positive, while for an 
inverted (and real) image, m is negative. 


Example 9.7 A magician during a show makes a glass lens with n = 1.47 disappear in a 
trough of liquid. What is the refractive index of the liquid? Could the liquid be water? 

Solution 

The refractive index of the liquid must be equal to 1.47 in order to make the lens disappear. 
This means n-| = ri2. This gives 1 /f =0 or f —>- °°. The lens in the liquid will act like a plane 

sheet of glass. No, the liquid is not water. It could be glycerine. 


9.5.3 Power of a lens 

Power of a lens is a measure of the convergence or divergence, which a 
lens introduces in the light falling on it. Clearly, a lens of shorter focal 
length bends the incident light more, while converging it in case of a 
convex lens and diverging it in case of a concave lens. The power P of a 
lens is defined as the tangent of the angle by which it converges or 
diverges a beam of light falling at unit distant from the optical centre (Fig. 
9.20). 

tan S = ; If h = 1, tan 8 = or S ~ for small value of 5. Thus, 




The SI unit for power of a lens is dioptre (D): ID = 1m -1 . The power of a 
lens of focal length of 1 metre is one dioptre. Power of a lens is positive 
for a converging lens and negative for a diverging lens. Thus, when an 
optician prescribes a corrective lens of power + 2.5 D, the required lens 
is a convex lens of focal length + 40 cm. A lens of power of - 4.0 D 
means a concave lens of focal length - 25 cm. 



Figure 9.20 Power of a lens. 

Example 9.8 (i) If f = 0.5 m for a glass lens, what is the power of the lens? (ii) The radii of 
curvature of the faces of a double convex lens are 10 cm and 15 cm. Its focal length is 12 cm. 
What is the refractive index of glass? (iii) A convex lens has 20 cm focal length in air. What is 
focal length in water? (Refractive index of air-water = 1.33, refractive index for air-glass = 
1.5.) 

Solution 

(i) Power = +2 dioptre. 

(ii) Here, we have f = +12 cm, R-| = +10 cm, R 2 = -15 cm. 

Refractive index of air is taken as unity. 










We use the lens formula of Eq. (9.22). The sign convention has to be applied for f, R-| and 
R 2 . 


Substituting the values, we have 


12 


■■in- 1 ] 






t 


This gives n = 1.5. 


(iii) For a glass lens in air, n 2 = 1.5, n 1 = 1, f = +20 cm. Hence, the lens formula gives 


I 

20 


0. 



For the same glass lens in water, n 2 = 1.5, n 1 = 1.33. Therefore, 


'JO 
_> i_h 
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= [1.5-1.33) 


1 _ 1 

2?, R, 


(9.26) 


Combining these two equations, we find f = + 78.2 cm. 


9.5.4 Combination of thin lenses in contact 

Consider two lenses A and B of focal length ^ and f 2 placed in contact 

with each other. Let the object be placed at a point O beyond the focus of 
the first lens A (Fig. 9.21). The first lens produces an image at 1^ Since 

image l-| is real, it serves as a virtual object for the second lens B, 

producing the final image at I. It must, however, be borne in mind that 
formation of image by the first lens is presumed only to facilitate 
determination of the position of the final image. In fact, the direction of 
rays emerging from the first lens gets modified in accordance with the 













angle at which they strike the second lens. Since the lenses are thin, we 
assume the optical centres of the lenses to be coincident. Let this central 
point be denoted by P. 

For the image formed by the first lens A, we get 


For the image formed by the second lens B, we get 
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Figure 9.21 Image formation by a combination of two thin lenses in contact. 


1 1 1 

--- - (9.28) 

t- ! t- ! i j 2 


Adding Eqs. (9.27) and (9.28), we get 


1 1 - 1 + 1 
u /, ft 


(9.29) 


If the two lens-system is regarded as equivalent to a single lens of focal 
length f, we have 

I 1-1 

V U / 


so that we get 


1 - 1 _ 1 

/ ,/i j 2 


(9.30) 








The derivation is valid for any number of thin lenses in contact. If several 
thin lenses of focal length f 1s f 2 , f 3 ,... are in contact, the effective focal 

length of their combination is given by 


1 _ 1 1 

7-7- X 


1 



+ ... (9.31) 


In terms of power, Eq. (9.31) can be written as 
P= P<\+ P 2 + P 3 + ... (9.32) 


where P is the net power of the lens combination. Note that the sum in 
Eq. (9.32) is an algebraic sum of individual powers, so some of the terms 
on the right side may be positive (for convex lenses) and some negative 
(for concave lenses). Combination of lenses helps to obtain diverging or 
converging lenses of desired magnification. It also enhances sharpness 
of the image. Since the image formed by the first lens becomes the 
object for the second, Eq. (9.25) implies that the total magnification m of 
the combination is a product of magnification ( m ^ m 2 , m 3 ,...) of individual 

lenses 

m = /t ? 1 m 2 m 3 ... (9.33) 


Such a system of combination of lenses is commonly used in designing 
lenses for cameras, microscopes, telescopes and other optical 
instruments. 


Example 9.9 Find the position of the image formed by the lens combination given in the Fig. 
9.22. 




f= + JO, -10 +30 cm 

A 

O 

*-30 cm-► 

- 5 -^— 10 —► 
cm cm 

Figure 9.22 

Solution Image formed by the first lens 

1 1 _ ] 

t’. /j 

_l 1 1 

I', -30 ~ 10 

or v 1 = 15 cm 

The image formed by the first lens serves as the object for the 
second. This is at a distance of (15 - 5) cm = 10 cm to the right of the 
second lens. Though the image is real, it serves as a virtual object for 
the second lens, which means that the rays appear to come from it for 
the second lens. 

1 1 ] 
v., in” 10 

or v 2 = 00 

The virtual image is formed at an infinite distance to the left of the 
second lens. This acts as an object for the third lens. 









1 1 1 

l\. IS . f 

1 1 1 

or = —+ — 

v., - :\o 

or v 3 = 30 cm 

The final image is formed 30 cm to the right of the third lens. 


9.6 Refraction through a Prism 

Figure 9.23 shows the passage of light through a triangular prism ABC. 
The angles of incidence and refraction at the first face AB are / and r 1; 

while the angle of incidence (from glass to air) at the second face AC is r 2 

and the angle of refraction or emergence e. The angle between the 
emergent ray RS and the direction of the incident ray PQ is called the 
angle of deviation, 5. 

In the quadrilateral AQNR, two of the angles (at the vertices Q and R) are 
right angles. Therefore, the sum of the other angles of the quadrilateral is 
180°. 

LA + ZQNR = 180° 

From the triangle QNR, 
r.| +r 2 + ZQNR =180° 

Comparing these two equations, we get 
r-| + r 2 = A (9.34) 

The total deviation 5 is the sum of deviations at the two faces, 

5 = (/-r 1 ) + (e-r 2 ) 





that is, 

5 = i+e-A (9.35) 

A 



Figure 9.23 A ray of light passing through a triangular glass prism. 

Thus, the angle of deviation depends on the angle of incidence. A plot 
between the angle of deviation and angle of incidence is shown in Fig. 
9.24. You can see that, in general, any given value of 5, except for / = e, 
corresponds to two values / and hence of e. This, in fact, is expected 
from the symmetry of / and e in Eq. (9.35), i.e., 5 remains the same if / 
and e are interchanged. Physically, this is related to the fact that the path 
of ray in Fig. 9.23 can be traced back, resulting in the same angle of 
deviation. At the minimum deviation D m , the refracted ray inside the prism 

becomes parallel to its base. We have 

5 = D m , / = e which implies ^ = r 2 . 

Equation (9.34) gives 

2r = A or r= — (9.36) 

2 


In the same way, Eq. (9.35) gives 



D m = 2/ - A, or i=(A + DJI 2 (9.37) 


The refractive index of the prism is 


n 


21 


sin LA/2] 


The angles A and D m can be measured experimentally. Equation (9.38) 

thus provides a method of determining refractive index of the material of 
the prism. 

For a small angle prism, i.e., a thin prism, D m is also very small, and we 
get 

_ sin [(A + 11.,)/ 2] (jU J) m )/2 
r ' :l bill [A/2) A/2 




It implies that, thin prisms do not deviate light much. 














Figure 9.24 Plot of angle of deviation (5) versus angle of incidence (i) fora triangular prism. 


9.7 Some Natural Phenomena due to Sunlight 

The Interplay of light with things around us gives rise to several beautiful 
phenomena. The spectacles of colours that we see around us all the time 
is possible due to sunlight. 

While studying dispersion of visible (or white) light by a prism(Class X) 
and the electromagnetic spectrum (Chapter 8, Class XII), we got to know 
that colour is associated with the frequency of light or the wavelength of 
light in the given medium. In the visible spectrum, red light is at the long 
wavelength end (-700 nm) while the violet light is at the short wavelength 
end (~ 400 nm). Dispersion takes place because the refractive index of 
medium for different frequencies (colours) is different. For example, the 
bending of red component of white light is least while it is most for the 
violet. Equivalently, red light travels faster than violet light in a glass 
prism. Table 9.2 gives the refractive indices for different wavelength for 
crown glass and flint glass. Thick lenses could be assumed as made of 
many prisms, therefore, thick lenses show chromatic aberration due to 
dispersion of light. When white light passes through thick lenses, red and 
blue colours focus at different points. This phenomenon is known as 
chromatic aberration. 


Table 9.2 Refractive indices for different wavelengths 

Colour 

Wavelength (nm) 

Crown glass 

Flint glass 

Violet 

396.9 

1.533 

1.663 

Blue 

486.1 

1.523 

1.639 

Yellow 

589.3 

1.517 

1.627 

Red 

656.3 

1.515 

1.622 


The variation of refractive index with wavelength may be more 
pronounced in some media than the other. In vacuum, of course, the 
speed of light is independent of wavelength. Thus, vacuum (or air 











approximately) is a non-dispersive medium in which all colours travel with 
the same speed. This also follows from the fact that sunlight reaches us 
in the form of white light and not as its components. On the other hand, 
glass is a dispersive medium. 

The blue of the sky, white clouds, the red-hue at sunrise and sunset, the 
rainbow, the brilliant colours of some pearls, shells, and wings of birds, 
are just a few of the natural wonders we are used to. We describe some 
of them here from the point of view of physics. 

9.7.1 The rainbow 

The rainbow is an example of the dispersion of sunlight by the water 
drops in the atmosphere. This is a phenomenon due to combined effect 
of dispersion, refraction and reflection of sunlight by spherical water 
droplets of rain. The conditions for observing a rainbow are that the Sun 
should be shining in one part of the sky (say near western horizon) while 
it is raining in the opposite part of the sky (say eastern horizon). An 
observer can therefore see a rainbow only when his back is towards the 
Sun. 

In order to understand the formation of rainbows, consider Fig. (9.25(a). 
Sunlight is first refracted as it enters a raindrop, which causes the 
different wavelengths (colours) of white light to separate. Longer 
wangelength of light (red) are bent the least while the shorter wavelength 
(violet) are bent the most. Next, these component rays strike the inner 
surface of the water drop and get internally reflected if the angle between 
the refracted ray and normal to the drop surface is greater then the 
critical angle (48° in this case). The reflected light is refracted again when 
it comes out of the drop, as shown in the figure. It is found that the violet 
light emerges at an angle of 40° related to the incoming sunlight and red 
light emerges at an angle of 42°. For other colours, angles lie in between 


these two values. 


Figure 9.25(b) explains the formation of primary rainbow. We see that red 
light from drop 1 and violet light from drop 2 reach the observer’s eye. 
The violet from drop 1 and red light from drop 2 are directed at level 
above or below the observer. Thus the observer sees a rainbow with red 
colour on the top and violet on the bottom. The primary rainbow is a 
result of three-step process, that is, refraction, reflection and refraction. 

When light rays undergoes two internal reflections inside a raindrop, 
instead of one as in the primary rainbow, a secondary rainbow is formed 
as shown in Fig. 9.25(c). It is due to four-step process. The intensity of 
light is reduced at the second reflection and hence the secondary 
rainbow is fainter than the primary rainbow. Further, the order of the 
colours is reversed in it as is clear from Fig. 9.25(c). 
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FIGURE 9.25 Rainbow: (a) The sun rays incident on a water drop get refracted twice and 
reflected internally by a drop; (b) Enlarged view of internal reflection and refraction of a ray of light 
inside a drop forming primary rainbow; and (c) secondary rainbow is formed by rays undergoing 

internal reflection twice inside the drop. 


9.7.2 Scattering of light 

As sunlight travels through the earth’s atmosphere, it gets scattered 
(changes its direction) by the atmospheric particles. Light of shorter 







wavelengths is scattered much more than light of longer wavelengths. 
(The amount of scattering is inversely proportional to the fourth power of 
the wavelength. This is known as Rayleigh scattering). Hence, the bluish 
colour predominates in a clear sky, since blue has a shorter wave length 
than red and is scattered much more strongly. In fact, violet gets 
scattered even more than blue, having a shorter wavelength. But since 
our eyes are more sensitive to blue than violet, we see the sky blue. 

Large particles like dust and water droplets present in the atmosphere 
behave differently. The relevant quantity here is the relative size of the 
wavelength of light A, and the scatterer (of typical size, say, a). For a « 

A, one has Rayleigh scattering which is proportional to 1/A 4 . For a » A, 
i.e., large scattering objects (for example, raindrops, large dust or ice 
particles) this is not true; all wavelengths are scattered nearly equally. 
Thus, clouds which have droplets of water with a » A are generally 
white. 

JtigL: Sun nearly 
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Figure 9.28 Sunlight travels through a longer distance in the atmosphere at sunset and sunrise. 

At sunset or sunrise, the sun’s rays have to pass through a larger 
distance in the atmosphere (Fig. 9.26). Most of the blue and other shorter 
wavelengths are removed by scattering. The least scattered light 
reaching our eyes, therefore, the sun looks reddish. This explains the 
reddish appearance of the sun and full moon near the horizon. 



Distance of atmosphere 
through which sunlight 
travels in atmosphere 


Observer 




9.8 Optical Instruments 

A number of optical devices and instruments have been designed utilising 
reflecting and refracting properties of mirrors, lenses and prisms. 
Periscope, kaleidoscope, binoculars, telescopes, microscopes are some 
examples of optical devices and instruments that are in common use. 
Our eye is, of course, one of the most important optical device the nature 
has endowed us with. We have already studied about the human eye in 
Class X. We now go on to describe the principles of working of the 
microscope and the telescope. 

9.8.1 The microscope 

A simple magnifier or microscope is a converging lens of small focal 
length (Fig. 9.27). In order to use such a lens as a microscope, the lens is 
held near the object, one focal length away or less, and the eye is 
positioned close to the lens on the other side. The idea is to get an erect, 
magnified and virtual image of the object at a distance so that it can be 
viewed comfortably, i.e., at 25 cm or more. If the object is at a distance f, 
the image is at infinity. However, if the object is at a distance slightly less 
than the focal length of the lens, the image is virtual and closer than 
infinity. Although the closest comfortable distance for viewing the image 
is when it is at the near point (distance D = 25 cm), it causes some strain 
on the eye. Therefore, the image formed at infinity is often considered 
most suitable for viewing by the relaxed eye. We show both cases, the 
first in Fig. 9.27(a), and the second in Fig. 9.27(b) and (c). 



(b) 



(c) 

Figure 9.27 A simple microscope; (a) the magnifying lens is located such that the image is at the 
near point, (b) the angle subtanded by the object, is the same as that at the near point, and (c) the 
object near the focal point of the lens; the image is far off but closer than infinity. 

The linear magnification m, for the image formed at the near point D, by a 

















simple microscope can be obtained by using the relation 
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Now according to our sign convention, v is negative, and is equal in 
magnitude to D. Thus, the magnification is 
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/r 

- 

f) 


(9.39) 


Since D is about 25 cm, to have a magnification of six, one needs a 
convex lens of focal length, f- 5 cm. 

Note that m = h'/h where h is the size of the object and h' the size of the 
image. This is also the ratio of the angle subtended by the image 
to that subtended by the object, if placed at D for comfortable viewing. 
(Note that this is not the angle actually subtended by the object at the 
eye, which is h/u.) What a single-lens simple magnifier achieves is that it 
allows the object to be brought closer to the eye than D. 


We will now find the magnification when the image is at infinity. In this 
case we will have to obtained the angular magnification. Suppose the 
object has a height h. The maximum angle it can subtend, and be clearly 
visible (without a lens), is when it is at the near point, i.e., a distance D. 
The angle subtended is then given by 
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(9.40) 


We now find the angle subtended at the eye by the image when the 
object is at u. From the relations 

hf v 


u 



we have the angle subtended by the image 
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tan^ = — =-- = —«0 . The angle subtended by the object, 
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when it is at u = -f. 
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0. - (9.41) 
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as is clear from Fig. 9.29(c). The angular magnification is, therefore 
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(9.42) 


This is one less than the magnification when the image is at the near 
point, Eq. (9.39), but the viewing is more comfortable and the difference 
in magnification is usually small. In subsequent discussions of optical 
instruments (microscope and telescope) we shall assume the image to 
be at infinity. 

A simple microscope has a limited maximum magnification (< 9) for 
realistic focal lengths. For much larger magnifications, one uses two 
lenses, one compounding the effect of the other. This is known as a 
compound microscope. A schematic diagram of a compound microscope 
is shown in Fig. 9.28. The lens nearest the object, called the objective, 
forms a real, inverted, magnified image of the object. This serves as the 
object for the second lens, the eyepiece, which functions essentially like 
a simple microscope or magnifier, produces the final image, which is 
enlarged and virtual. The first inverted image is thus near (at or within) 
the focal plane of the eyepiece, at a distance appropriate for final image 
formation at infinity, or a little closer for image formation at the near point. 
Clearly, the final image is inverted with respect to the original object. 






Figure 9.28 Ray diagram for the formation of image by a compound microscope. 

We now obtain the magnification due to a compound microscope. The 
ray diagram of 

Fig. 9.28 shows that the (linear) magnification due to the objective, 
namely h'/h, equals 

h f L 

= 77 = y < 9 - 43 ) 

where we have used the result 


Here h' is the size of the first image, the object size being h and f 0 being 

the focal length of the objective. The first image is formed near the focal 
point of the eyepiece. The distance L, i.e., the distance between the 
second focal point of the objective and the first focal point of the eyepiece 
(focal length f e ) is called the tube length of the compound microscope. 













As the first inverted image is near the focal point of the eyepiece, we use 
the result from the discussion above for the simple microscope to obtain 
the (angular) magnification m e due to it [Eq. (9.39)], when the final image 

is formed at the near point, is 
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[9.44(a)] 


When the final image is formed at infinity, the angular magnification due 
to the eyepiece [Eq. (9.42)] is 


m e = ( D/f e ) [9.44(b)] 


Thus, the total magnification [(according to Eq. (9.33)], when the image is 
formed at infinity, is 
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Clearly, to achieve a large magnification of a small object (hence the 
name microscope), the objective and eyepiece should have small focal 
lengths. In practice, it is difficult to make the focal length much smaller 
than 1 cm. Also large lenses are required to make L large. 


For example, with an objective with f 0 = 1.0 cm, and an eyepiece with 
focal length f e = 2.0 cm, and a tube length of 20 cm, the magnification is 


m =m m 
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Various other factors such as illumination of the object, contribute to the 



quality and visibility of the image. In modern microscopes, multi- 
component lenses are used for both the objective and the eyepiece to 
improve image quality by minimising various optical aberrations (defects) 
in lenses. 

9.8.2 Telescope 

The telescope is used to provide angular magnification of distant objects 
(Fig. 9.29). It also has an objective and an eyepiece. But here, the 
objective has a large focal length and a much larger aperture than the 
eyepiece. Light from a distant object enters the objective and a real 
image is formed in the tube at its second focal point. The eyepiece 
magnifies this image producing a final inverted image. The magnifying 
power m is the ratio of the angle (3 subtended at the eye by the final 
image to the angle a which the object subtends at the lens or the eye. 
Hence 

b h /;, /', 

m » — » —= -— (9.46) 

a /, h f 

In this case, the length of the telescope tube is f 0 + f e . 

Terrestrial telescopes have, in addition, a pair of inverting lenses to make 
the final image erect. Refracting telescopes can be used both for 
terrestrial and astronomical observations. For example, consider a 
telescope whose objective has a focal length of 100 cm and the eyepiece 
a focal length of 1 cm. The magnifying power of this telescope is m = 
100/1 = 100. 

Let us consider a pair of stars of actual separation V (one minute of arc). 
The stars appear as though they are separated by an angle of 100 * V = 
100'=1.67°. 





The main considerations with an astronomical telescope are its light 
gathering power and its resolution or resolving power. The former clearly 
depends on the area of the objective. With larger diameters, fainter 
objects can be observed. The resolving power, or the ability to observe 
two objects distinctly, which are in very nearly the same direction, also 
depends on the diameter of the objective. So, the desirable aim in optical 
telescopes is to make them with objective of large diameter. The largest 
lens objective in use has a diameter of 40 inch (-1.02 m). It is at the 
Yerkes Observatory in Wisconsin, USA. Such big lenses tend to be very 
heavy and therefore, difficult to make and support by their edges. Further, 
it is rather difficult and expensive to make such large sized lenses which 
form images that are free from any kind of chromatic aberration and 
distortions. 




Figure 9.29 A refracting telescope. 

For these reasons, modern telescopes use a concave mirror rather than 
a lens for the objective. Telescopes with mirror objectives are called 
reflecting telescopes. There is no chromatic aberration in a mirror. 
Mechanical support is much less of a problem since a mirror weighs 









much less than a lens of equivalent optical quality, and can be supported 
over its entire back surface, not just over its rim. One obvious problem 
with a reflecting telescope is that the objective mirror focusses light inside 
the telescope tube. One must have an eyepiece and the observer right 
there, obstructing some light (depending on the size of the observer 
cage). This is what is done in the very large 200 inch (-5.08 m) 
diameters, Mt. Palomar telescope, California. The viewer sits near the 
focal point of the mirror, in a small cage. Another solution to the problem 
is to deflect the light being focussed by another mirror. One such 
arrangement using a convex secondary mirror to focus the incident light, 
which now passes through a hole in the objective primary mirror, is 
shown in Fig. 9.30. This is known as a Cassegrain telescope, after its 
inventor. It has the advantages of a large focal length in a short 
telescope. The largest telescope in India is in Kavalur, Tamil Nadu. It is a 
2.34 m diameter reflecting telescope (Cassegrain). It was ground, 
polished, set up, and is being used by the Indian Institute of Astrophysics, 
Bangalore. The largest reflecting telescopes in the world are the pair of 
Keck telescopes in Hawaii, USA, with a reflector of 10 metre in diameter. 



Figure 9.30 Schematic diagram of a reflecting telescope (Cassegrain). 










Summary 

1. Reflection is governed by the equation L\ - Lx' and refraction by the Snell’s law, sini/sinr = 
n, where the incident ray, reflected ray, refracted ray and normal lie in the same plane. Angles 
of incidence, reflection and refraction are i, r' and r, respectively. 

2. The critical angle of incidence i c for a ray incident from a denser to rarer medium, is that 
angle for which the angle of refraction is 90°. For i > i c , total internal reflection occurs. 
Multiple internal reflections in diamond (i c = 24.4°), totally reflecting prisms and mirage, are 

some examples of total internal reflection. Optical fibres consist of glass fibres coated with a 
thin layer of material of lower refractive index. Light incident at an angle at one end comes 
out at the other, after multiple internal reflections, even if the fibre is bent. 

3. Cartesian sign convention: Distances measured in the same direction as the incident light 
are positive; those measured in the opposite direction are negative. All distances are 
measured from the pole/optic centre of the mirror/lens on the principal axis. The heights 
measured upwards above x-axis and normal to the principal axis of the mirror/lens are taken 
as positive. The heights measured downwards are taken as negative. 

4. Mirror equation: 

1 1 1 

V + U 7 

where u and v are object and image distances, respectively and f is the focal length of the 
mirror, f is (approximately) half the radius of curvature R. f is negative for concave mirror; f is 
positive for a convex mirror. 


5. For a prism of the angle A, of refractive index x\2 placed in a medium of refractive index n-|, 
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where D m is the angle of minimum deviation. 


6 . For refraction through a spherical interface (from medium 1 to 2 of refractive index n-| and 
n 2 , respectively) 
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Thin lens formula 
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Lens maker’s formula 
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R-l and R 2 are the radii of curvature of the lens surfaces, f is positive for a converging lens; f 
is negative for a diverging lens. The power of a lens P = 1/f. 


The SI unit for power of a lens is dioptre (D): 1 D = 1 m 


-1 


If several thin lenses of focal length T|, f 2 , f 3 ,.. are in contact, the effective focal length of their 
combination, is given by 

111 1 

J' J' J, J, ' 

The total power of a combination of several lenses is 


P = P-l + P 2 + P 3 + ... 


7. Dispersion is the splitting of light into its constituent colours. 

8 . Magnifying power m of a simple microscope is given by m = 1 + (D/f), where D = 25 cm is 
the least distance of distinct vision and f is the focal length of the convex lens. If the image is 
at infinity, m = D/f. For a compound microscope, the magnifying power is given by 






m = m e x mg where m e = 1 + (D/f e ), is the magnification due to the eyepiece and m 0 is the 
magnification produced by the objective. Approximately, 

L D 
m - x 

So s, 

where f 0 and f e are the focal lengths of the objective and eyepiece, respectively, and L is the 
distance between their focal points. 

9. Magnifying power m of a telescope is the ratio of the angle 3 subtended at the eye by the 
image to the angle a subtended at the eye by the object. 
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where f and f are the focal lengths of the objective and eyepiece, respectively. 


Points to Ponder 

1. The laws of reflection and refraction are true for all surfaces and pairs of media at the point 
of the incidence. 

2. The real image of an object placed between f and 2f from a convex lens can be seen on a 
screen placed at the image location. If the screen is removed, is the image still there? This 
question puzzles many, because it is difficult to reconcile ourselves with an image suspended 
in air without a screen. But the image does exist. Rays from a given point on the object are 
converging to an image point in space and diverging away. The screen simply diffuses these 
rays, some of which reach our eye and we see the image. This can be seen by the images 
formed in air during a laser show. 

3. Image formation needs regular reflection/refraction. In principle, all rays from a given point 
should reach the same image point. This is why you do not see your image by an irregular 
reflecting object, say the page of a book. 




4. Thick lenses give coloured images due to dispersion. The variety in colour of objects we 
see around us is due to the constituent colours of the light incident on them. A 
monochromatic light may produce an entirely different perception about the colours on an 
object as seen in white light. 

5. For a simple microscope, the angular size of the object equals the angular size of the 
image. Yet it offers magnification because we can keep the small object much closer to the 
eye than 25 cm and hence have it subtend a large angle. The image is at 25 cm which we 
can see. Without the microscope, you would need to keep the small object at 25 cm which 
would subtend a very small angle. 
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Exercises 

9.1 A small candle, 2.5 cm in size is placed at 27 cm in front of a 
concave mirror of radius of curvature 36 cm. At what distance from 
the mirror should a screen be placed in order to obtain a sharp 
image? Describe the nature and size of the image. If the candle is 
moved closer to the mirror, how would the screen have to be moved? 

9.2 A 4.5 cm needle is placed 12 cm away from a convex mirror of 
focal length 15 cm. Give the location of the image and the 
magnification. Describe what happens as the needle is moved farther 
from the mirror. 

9.3 A tank is filled with water to a height of 12.5 cm. The apparent 
depth of a needle lying at the bottom of the tank is measured by a 
microscope to be 9.4 cm. What is the refractive index of water? If 
water is replaced by a liquid of refractive index 1.63 up to the same 
height, by what distance would the microscope have to be moved to 
focus on the needle again? 

9.4 Figures 9.31(a) and (b) show refraction of a ray in air incident at 







60° with the normal to a glass-air and water-air interface, 
respectively. Predict the angle of refraction in glass when the angle of 
incidence in water is 45° with the normal to a water-glass interface 
[Fig. 9.31(c)], 





Figure 9.31 


9.5 A small bulb is placed at the bottom of a tank containing water to 
a depth of 80cm. What is the area of the surface of water through 
which light from the bulb can emerge out? Refractive index of water 
is 1.33. (Consider the bulb to be a point source.) 

9.6 A prism is made of glass of unknown refractive index. A parallel 
beam of light is incident on a face of the prism. The angle of minimum 
deviation is measured to be 40°. What is the refractive index of the 
material of the prism? The refracting angle of the prism is 60°. If the 
prism is placed in water (refractive index 1.33), predict the new angle 
of minimum deviation of a parallel beam of light. 

9.7 Double-convex lenses are to be manufactured from a glass of 
refractive index 1.55, with both faces of the same radius of curvature. 
What is the radius of curvature required if the focal length is to be 
20cm? 


9.8 A beam of light converges at a point P. Now a lens is placed in the 










path of the convergent beam 12cm from P. At what point does the 
beam converge if the lens is (a) a convex lens of focal length 20cm, 
and (b) a concave lens of focal length 16cm? 

9.9 An object of size 3.0cm is placed 14cm in front of a concave lens 
of focal length 21cm. Describe the image produced by the lens. What 
happens if the object is moved further away from the lens? 

9.10 What is the focal length of a convex lens of focal length 30cm in 
contact with a concave lens of focal length 20cm? Is the system a 
converging or a diverging lens? Ignore thickness of the lenses. 

9.11 A compound microscope consists of an objective lens of focal 
length 2.0cm and an eyepiece of focal length 6.25cm separated by a 
distance of 15cm. How far from the objective should an object be 
placed in order to obtain the final image at (a) the least distance of 
distinct vision (25cm), and (b) at infinity? What is the magnifying 
power of the microscope in each case? 

9.12 A person with a normal near point (25cm) using a compound 
microscope with objective of focal length 8.0 mm and an eyepiece of 
focal length 2.5cm can bring an object placed at 9.0mm from the 
objective in sharp focus. What is the separation between the two 
lenses? Calculate the magnifying power of the microscope, 

9.13 A small telescope has an objective lens of focal length 144cm 
and an eyepiece of focal length 6.0cm. What is the magnifying power 
of the telescope? What is the separation between the objective and 
the eyepiece? 

9.14 (a) A giant refracting telescope at an observatory has an 
objective lens of focal length 15m. If an eyepiece of focal length 
1.0cm is used, what is the angular magnification of the telescope? 






(b) If this telescope is used to view the moon, what is the diameter of 
the image of the moon formed by the objective lens? The diameter of 

the moon is 3.48 * 10 6 m, and the radius of lunar orbit is 3.8 * 10 8 m. 

9.15 Use the mirror equation to deduce that: 

(a) an object placed between f and 2f of a concave mirror produces a 
real image beyond 2f. 

(b) a convex mirror always produces a virtual image independent of 
the location of the object. 

(c) the virtual image produced by a convex mirror is always 
diminished in size and is located between the focus and 

the pole. 

(d) an object placed between the pole and focus of a concave mirror 
produces a virtual and enlarged image. 

[Note: This exercise helps you deduce algebraically properties of 
images that one obtains from explicit ray diagrams.] 

9.16 A small pin fixed on a table top is viewed from above from a 
distance of 50cm. By what distance would the pin appear to be raised 
if it is viewed from the same point through a 15cm thick glass slab 
held parallel to the table? Refractive index of glass = 1.5. Does the 
answer depend on the location of the slab? 

9.17 (a) Figure 9.32 shows a cross-section of a ‘light pipe’ made of a 
glass fibre of refractive index 1.68. The outer covering of the pipe is 
made of a material of refractive index 1.44. What is the range of the 
angles of the incident rays with the axis of the pipe for which total 
reflections inside the pipe take place, as shown in the figure. 

(b) What is the answer if there is no outer covering of the pipe? 
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Figure 9.32 

9.18 Answer the following questions: 

(a) You have learnt that plane and convex mirrors produce virtual 
images of objects. Can they produce real images under some 
circumstances? Explain. 

(b) A virtual image, we always say, cannot be caught on a screen. Yet 
when we ‘see’ a virtual image, we are obviously bringing it on to the 
‘screen’ (i.e., the retina) of our eye. Is there a contradiction? 

(c) A diver under water, looks obliquely at a fisherman standing on 
the bank of a lake. Would the fisherman look taller or shorter to the 
diver than what he actually is? 

(d) Does the apparent depth of a tank of water change if viewed 
obliquely? If so, does the apparent depth increase or decrease? 

(e) The refractive index of diamond is much greater than that of 
ordinary glass. Is this fact of some use to a diamond cutter? 

9.19 The image of a small electric bulb fixed on the wall of a room is 
to be obtained on the opposite wall 3m away by means of a large 
convex lens. What is the maximum possible focal length of the lens 
required for the purpose? 

9.20 A screen is placed 90cm from an object. The image of the object 
on the screen is formed by a convex lens at two different locations 










separated by 20cm. Determine the focal length of the lens. 

9.21 (a) Determine the ‘effective focal length’ of the combination of 
the two lenses in Exercise 9.10, if they are placed 8.0cm apart with 
their principal axes coincident. Does the answer depend on which 
side of the combination a beam of parallel light is incident? Is the 
notion of effective focal length of this system useful at all? 

(b) An object 1.5 cm in size is placed on the side of the convex lens 
in the arrangement (a) above. The distance between the object and 
the convex lens is 40cm. Determine the magnification produced by 
the two-lens system, and the size of the image. 

9.22 At what angle should a ray of light be incident on the face of a 
prism of refracting angle 60° so that it just suffers total internal 
reflection at the other face? The refractive index of the material of the 
prism is 1.524. 

9.23 A card sheet divided into squares each of size 1 mm 2 is being 
viewed at a distance of 9 cm through a magnifying glass (a 
converging lens of focal length 9 cm) held close to the eye. 

(a) What is the magnification produced by the lens? How much is the 
area of each square in the virtual image? 

(b) What is the angular magnification (magnifying power) of the lens? 

(c) Is the magnification in (a) equal to the magnifying power in (b)? 
Explain. 

9.24 (a) At what distance should the lens be held from the figure in 
Exercise 9.29 in order to view the squares distinctly with the 
maximum possible magnifying power? 

(b) What is the magnification in this case? 





(c) Is the magnification equal to the magnifying power in this case? 
Explain. 

9.25 What should be the distance between the object in Exercise 
9.24 and the magnifying glass if the virtual image of each square in 

the figure is to have an area of 6.25 mm 2 . Would you be able to see 
the squares distinctly with your eyes very close to the magnifier? 

[Note: Exercises 9.23 to 9.25 will help you clearly understand the 
difference between magnification in absolute size and the angular 
magnification (or magnifying power) of an instrument.] 

9.26 Answer the following questions: 

(a) The angle subtended at the eye by an object is equal to the angle 
subtended at the eye by the virtual image produced by a magnifying 
glass. In what sense then does a magnifying glass provide angular 
magnification? 

(b) In viewing through a magnifying glass, one usually positions one’s 
eyes very close to the lens. Does angular magnification change if the 
eye is moved back? 

(c) Magnifying power of a simple microscope is inversely proportional 
to the focal length of the lens. What then stops us from using a 
convex lens of smaller and smaller focal length and achieving greater 
and greater magnifying power? 

(d) Why must both the objective and the eyepiece of a compound 
microscope have short focal lengths? 

(e) When viewing through a compound microscope, our eyes should 
be positioned not on the eyepiece but a short distance away from it 
for best viewing. Why? How much should be that short distance 
between the eye and eyepiece? 





9.27 An angular magnification (magnifying power) of 30X is desired 
using an objective of focal length 1,25cm and an eyepiece of focal 
length 

5cm. How will you set up the compound microscope? 

9.28 A small telescope has an objective lens of focal length 140cm 
and an eyepiece of focal length 5.0cm. What is the magnifying power 
of the telescope for viewing distant objects when 

(a) the telescope is in normal adjustment (i.e., when the final image is 
at infinity)? 

(b) the final image is formed at the least distance of distinct vision 
(25cm)? 

9.29 (a) For the telescope described in Exercise 9.28 (a), what is the 
separation between the objective lens and the eyepiece? 

(b) If this telescope is used to view a 100 m tall tower 3 km away, 
what is the height of the image of the tower formed by the objective 
lens? 

(c) What is the height of the final image of the tower if it is formed at 
25cm? 

9.30 A Cassegrain telescope uses two mirrors as shown in Fig. 9.30. 
Such a telescope is built with the mirrors 20mm apart. If the radius of 
curvature of the large mirror is 220mm and the small mirror is 
140mm, where will the final image of an object at infinity be? 

9.31 Light incident normally on a plane mirror attached to a 
galvanometer coil retraces backwards as shown in Fig. 9.33. A 

current in the coil produces a deflection of 3.5° of the mirror. What is 
the displacement of the reflected spot of light on a screen placed 1.5 
m away? 
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Figure 9.33 

9.32 Figure 9.34 shows an equiconvex lens (of refractive index 1.50) 
in contact with a liquid layer on top of a plane mirror. A small needle 
with its tip on the principal axis is moved along the axis until its 
inverted image is found at the position of the needle. The distance of 
the needle from the lens is measured to be 45.0cm. The liquid is 
removed and the experiment is repeated. The new distance is 
measured to be 30.0cm. What is the refractive index of the liquid? 
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Chapter Ten 


Wave Optics 


10.1 Introduction 

In 1637 Descartes gave the corpuscular model of light and derived 
Snell’s law. It explained the laws of reflection and refraction of light at an 
interface. The corpuscular model predicted that if the ray of light (on 
refraction) bends towards the normal then the speed of light would be 
greater in the second medium. This corpuscular model of light was 
further developed by Isaac Newton in his famous book entitled OPTICKS 
and because of the tremendous popularity of this book, the corpuscular 
model is very often attributed to Newton. 

In 1678, the Dutch physicist Christiaan Huygens put forward the wave 
theory of light - it is this wave model of light that we will discuss in this 
chapter. As we will see, the wave model could satisfactorily explain the 
phenomena of reflection and refraction; however, it predicted that on 
refraction if the wave bends towards the normal then the speed of light 
would be less in the second medium. This is in contradiction to the 
prediction made by using the corpuscular model of light. It was much 
later confirmed by experiments where it was shown that the speed of light 
in water is less than the speed in air confirming the prediction of the wave 
model; Foucault carried out this experiment in 1850. 

The wave theory was not readily accepted primarily because of Newton’s 
authority and also because light could travel through vacuum and it was 
felt that a wave would always require a medium to propagate from one 
point to the other. However, when Thomas Young performed his famous 




interference experiment in 1801, it was firmly established that light is 
indeed a wave phenomenon. The wavelength of visible light was 
measured and found to be extremely small; for example, the wavelength 
of yellow light is about 0.6 pm. Because of the smallness of the 
wavelength of visible light (in comparison to the dimensions of typical 
mirrors and lenses), light can be assumed to approximately travel in 
straight lines. This is the field of geometrical optics, which we had 
discussed in the previous chapter. Indeed, the branch of optics in which 
one completely neglects the finiteness of the wavelength is called 
geometrical optics and a ray is defined as the path of energy propagation 
in the limit of wavelength tending to zero. 

After the interference experiment of Young in 1801, for the next 40 years 
or so, many experiments were carried out involving the interference and 
diffraction of lightwaves; these experiments could only be satisfactorily 
explained by assuming a wave model of light. Thus, around the middle of 
the nineteenth century, the wave theory seemed to be very well 
established. The only major difficulty was that since it was thought that a 
wave required a medium for its propagation, how could light waves 
propagate through vacuum. This was explained when Maxwell put 
forward his famous electromagnetic theory of light. Maxwell had 
developed a set of equations describing the laws of electricity and 
magnetism and using these equations he derived what is known as the 
wave equation from which he predicted the existence of electromagnetic 
waves*. From the wave equation, Maxwell could calculate the speed of 
electromagnetic waves in free space and he found that the theoretical 
value was very close to the measured value of speed of light. From this, 
he propounded that light must be an electromagnetic wave. Thus, 
according to Maxwell, light waves are associated with changing electric 
and magnetic fields; changing electric field produces a time and space 



varying magnetic field and a changing magnetic field produces a time 
and space varying electric field. The changing electric and magnetic 
fields result in the propagation of electromagnetic waves (or light waves) 
even in vacuum. 

In this chapter we will first discuss the original formulation of the Huygens 
principle and derive the laws of reflection and refraction. In Sections 10.4 
and 10.5, we will discuss the phenomenon of interference which is based 
on the principle of superposition. In Section 10.6 we will discuss the 
phenomenon of diffraction which is based on Huygens-Fresnel principle. 
Finally in Section 10.7 we will discuss the phenomenon of polarisation 
which is based on the fact that the light waves are transverse 
electromagnetic waves. 

* Maxwell had predicted the existence of electromagnetic waves around 
1855; it was much later (around 1890) that Heinrich Hertz produced 
radiowaves in the laboratory. J.C. Bose and G. Marconi made practical 
applications of the Hertzian waves 


Does light travel in a straight line? 

Light travels in a straight line in Class VI; it does not do so in Class XII and beyond! 
Surprised, aren’t you? 

In school, you are shown an experiment in which you take three cardboards with pinholes in 
them, place a candle on one side and look from the other side. If the flame of the candle and 
the three pinholes are in a straight line, you can see the candle. Even if one of them is 
displaced a little, you cannot see the candle. This proves, so your teacher says, that light 
travels in a straight line. 


In the present book, there are two consecutive chapters, one on ray optics and the other on 


wave optics. Ray optics is based on rectilinear propagation of light, and deals with mirrors, 
lenses, reflection, refraction, etc. Then you come to the chapter on wave optics, and you are 
told that light travels as a wave, that it can bend around objects, it can diffract and interfere, 
etc. 

In optical region, light has a wavelength of about half a micrometre. If it encounters an 
obstacle of about this size, it can bend around it and can be seen on the other side. Thus a 
micrometre size obstacle will not be able to stop a light ray. If the obstacle is much larger, 
however, light will not be able to bend to that extent, and will not be seen on the other side. 

This is a property of a wave in general, and can be seen in sound waves too. The sound 
wave of our speech has a wavelength of about 50cm to 1 m. If it meets an obstacle of the 
size of a few metres, it bends around it and reaches points behind the obstacle. But when it 
comes across a larger obstacle of a few hundred metres, such as a hillock, most of it is 
reflected and is heard as an echo. 

Then what about the primary school experiment? What happens there is that when we move 
any cardboard, the displacement is of the order of a few millimetres, which is much larger 
than the wavelength of light. Hence the candle cannot be seen. If we are able to move one of 
the cardboards by a micrometer or less, light will be able to diffract, and the candle will still be 
seen. 

One could add to the first sentence in this box: It learns how to bend as it grows up\ 


10.2 Huygens Principle 

We would first define a wavefront: when we drop a small stone on a calm 
pool of water, waves spread out from the point of impact. Every point on 
the surface starts oscillating with time. At any instant, a photograph of the 
surface would show circular rings on which the disturbance is maximum. 
Clearly, all points on such a circle are oscillating in phase because they 
are at the same distance from the source. Such a locus of points, which 


oscillate in phase is called a wavefront, thus a wavefront is defined as a 
surface of constant phase. The speed with which the wavefront moves 
outwards from the source is called the speed of the wave. The energy of 
the wave travels in a direction perpendicular to the wavefront. 



Figure 10.1 (a) A diverging spherical wave emanating from a point source. The wavefronts are 

spherical. 

If we have a point source emitting waves uniformly in all directions, then 
the locus of points which have the same amplitude and vibrate in the 
same phase are spheres and we have what is known as a spherical 
wave as shown in Fig. 10.1(a). At a large distance from the source, a 
small portion of the sphere can be considered as a plane and we have 
what is known as a plane wave [Fig. 10.1(b)]. 



Figure 10.1 (b) At a large distance from the source, a small portion of the spherical wave can be 
approximated by a plane wave. 

Now, if we know the shape of the wavefront at t = 0, then Huygens 
principle allows us to determine the shape of the wavefront at a later time 








t. Thus, Huygens principle is essentially a geometrical construction, 
which given the shape of the wafefront at any time allows us to determine 
the shape of the wavefront at a later time. Let us consider a diverging 
wave and let F 1 F 2 represent a portion of the spherical wavefront at t = 0 

(Fig. 10.2). Now, according to Huygens principle, each point of the 
wavefront is the source of a secondary disturbance and the wavelets 
emanating from these points spread out in all directions with the speed of 
the wave. These wavelets emanating from the wavefront are usually 
referred to as secondary wavelets and if we draw a common tangent to 
all these spheres, we obtain the new position of the wavefront at a later 
time. 

Thus, if we wish to determine the shape of the wavefront at t = t, we draw 
spheres of radius vt from each point on the spherical wavefront where v 
represents the speed of the waves in the medium. If we now draw a 
common tangent to all these spheres, we obtain the new position of the 
wavefront at t = t. The new wavefront shown as G-|G 2 in Fig. 10.2 is 

again spherical with point O as the centre. 
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Figure 10.2 FjFg represents the spherical wavefront (with O as centre) att = 0. The envelope of 
the secondary wavelets emanating from F-/F 2 produces the forward moving wavefront G-/G 2 - The 



backwave D'fDg does not exist. 
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Figure 10.3 Huygens geometrical construction fora plane wave propagating to the right. Fy Fg is 
the plane wavefront att = 0 and G^Gg is the wavefront at a later time t. The lines A-jAg, BjBg ... 
etc., are normal to both FjFg and G-jGg and represent rays. 


The above model has one shortcoming: we also have a backwave which 
is shown as 0^2 in Fig. 10.2. Huygens argued that the amplitude of the 

secondary wavelets is maximum in the forward direction and zero in the 
backward direction; by making this adhoc assumption, Huygens could 
explain the absence of the backwave. However, this adhoc assumption is 
not satisfactory and the absence of the backwave is really justified from 
more rigorous wave theory. 

In a similar manner, we can use Huygens principle to determine the 




shape of the wavefront for a plane wave propagating through a medium 
(Fig. 10.3). 

10.3 Refraction and Reflection of Plane Waves using 
Huygens Principle 

10.3.1 Refraction of a plane wave 

We will now use Huygens principle to derive the laws of refraction. Let 
PP' represent the surface separating medium 1 and medium 2, as shown 
in Fig. 10.4. Let v 1 and v 2 represent the speed of light in medium 1 and 

medium 2, respectively. We assume a plane wavefront AB propagating in 
the direction A'A incident on the interface at an angle / as shown in the 
figure. Let t be the time taken by the wavefront to travel the distance BC. 
Thus, 


BC = v 1 t 


p 



FIGURE 10.4 A plane wave AB is incident at an angle i on the surface PP$ separating medium 1 
and medium 2. The plane wave undergoes refraction and CE represents the refracted wavefront. 
The figure corresponds to v2 < vl so that the refracted waves bends towards the normal. 



Christiaan Huygens (1629 - 1695) 


Dutch physicist, astronomer, mathematician and the founder of the 
wave theory of light. His book, Treatise on light, makes fascinating 
reading even today. He brilliantly explained the double refraction 
shown by the mineral calcite in this work in addition to reflection and 
refraction. He was the first to analyse circular and simple harmonic 
motion and designed and built improved clocks and telescopes. He 
discovered the true geometry of Saturn’s rings. 


In order to determine the shape of the refracted wavefront, we draw a 
sphere of radius v 2 t from the point A in the second medium (the speed of 

the wave in the second medium is v 2 ). Let CE represent a tangent plane 

drawn from the point C on to the sphere. Then, AE = v 2 t and CE would 

represent the refracted wavefront. If we now consider the triangles ABC 
and AEC, we readily obtain Figure 10.4 A plane wave AB is incident at an 
angle i on the surface PP' separating medium 1 and medium 2. The 
plane wave undergoes refraction and CE represents the refracted 
wavefront. The figure corresponds to v 2 < v 1 so that the refracted waves 


bends towards the normal. 
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where / and r are the angles of incidence and refraction, respectively. 
Thus we obtain 


Sill L L\ 

-= — (10.3) 

sin r v 2 

From the above equation, we get the important result that if r < i (i.e., if 
the ray bends toward the normal), the speed of the light wave in the 
second medium ( v 2 ) will be less then the speed of the light wave in the 

first medium (v-\). This prediction is opposite to the prediction from the 

corpuscular model of light and as later experiments showed, the 
prediction of the wave theory is correct. Now, if c represents the speed of 
light in vacuum, then, 

n, - — (10.4) 

C 

and 


n 2 = ( (10.5) 

v 2 

are known as the refractive indices of medium 1 and medium 2, 
respectively. In terms of the refractive indices, Eq. (10.3) can be written 
as 



n 1 sin / = n 2 sin r (10.6) 

This is the Snell’s law of refraction. Further, if A 1 and A 2 denote the 

wavelengths of light in medium 1 and medium 2, respectively and if the 
distance BC is equal to A 1 then the distance AE will be equal to A 2 

(because if the crest from B has reached C in time t, then the crest from 
A should have also reached E in time t); thus, 

k x _ _ BC _ E.'j 

/, 2 A K v 2 

or 


The above equation implies that when a wave gets refracted into a 
denser medium (v 1 > v 2 ) the wavelength and the speed of propagation 

decrease but the frequency v (= v/k) remains the same. 

10.3.2 Refraction at a rarer medium 

We now consider refraction of a plane wave at a rarer medium, i.e., v 2 > 

\A|. Proceeding in an exactly similar manner we can construct a refracted 

wavefront as shown in Fig. 10.5. The angle of refraction will now be 
greater than angle of incidence; however, we will still have r? 1 sin / = n 2 

sin r. We define an angle / c by the following equation 
sin I., = — (10.8) 


Thus, if / = / c then sin r = 1 and r = 90°. Obviously, for / > / c , there can not 




be any refracted wave. The angle i c is known as the critical angle and for 

all angles of incidence greater than the critical angle, we will not have any 
refracted wave and the wave will undergo what is known as total internal 
reflection. The phenomenon of total internal reflection and its applications 
was discussed in Section 9.4. 



FIGURE 10.5 Refraction of a plane wave incident on a rarer medium for which v2 > vl. The plane 

wave bends away from the normal. 

10.3.3 Reflection of a plane wave by a plane surface 

We next consider a plane wave AB incident at an angle / on a reflecting 
surface MN. If v represents the speed of the wave in the medium and if t 
represents the time taken by the wavefront to advance from the point B to 
C then the distance Figure 10.5 Refraction of a plane wave incident on a 
rarer medium for which v2 > vl. The plane wave bends away from the 
normal. 


BC = vt 


In order to construct the reflected wavefront we draw a sphere of radius 
vt from the point A as shown in Fig. 10.6. Let CE represent the tangent 
plane drawn from the point C to this sphere. Obviously 



AE = BC = vr 



FIGURE 10.6 Reflection of a plane wave AB by the reflecting surface MN. 

AB and CE represent incident and reflected wavefronts. 

If we now consider the triangles EAC and BAC we will find that they are 
congruent and therefore, the angles / and r (as shown in Fig. 10.6) would 
be equal. This is the law of reflection. 

Once we have the laws of reflection and refraction, the behaviour of 
prisms, lenses, and mirrors can be understood. These phenomena were 
discussed in detail in Chapter 9 on the basis of rectilinear propagation of 
light. Here we just describe the behaviour of the wavefronts as they 
undergo reflection or refraction. In Fig. 10.7(a) we consider a plane wave 
passing through a thin prism. Clearly, since the speed of light waves is 
less in glass, the lower portion of the incoming wavefront (which travels 
through the greatest thickness of glass) will get delayed resulting in a tilt 
in the emerging wavefront as shown in the figure. In Fig. 10.7(b) we 
consider a plane wave incident on a thin convex lens; the central part of 
the incident plane wave traverses the thickest portion of the lens and is 
delayed the most. The emerging wavefront has a depression at the 
centre and therefore the wavefront becomes spherical and converges to 
the point F which is known as the focus. In Fig. 10.7(c) a plane wave is 
incident on a concave mirror and on reflection we have a spherical wave 
converging to the focal point F. In a similar manner, we can understand 




refraction and reflection by concave lenses and convex mirrors. 

CuncQvr iim'citr 



Figure 10.7 Refraction of a plane wave by (a) a thin prism, (b) a convex lens, (c) Reflection ot 
a plane wave by a concave mirror. 

From the above discussion it follows that the total time taken from a point 
on the object to the corresponding point on the image is the same 
measured along any ray. For example, when a convex lens focusses light 
to form a real image, although the ray going through the centre traverses 
a shorter path, but because of the slower speed in glass, the time taken 
is the same as for rays travelling near the edge of the lens. 

10.3.4 The doppler effect 

We should mention here that one should be careful in constructing the 
wavefronts if the source (or the observer) is moving. For example, if there 
is no medium and the source moves away from the observer, then later 
wavefronts have to travel a greater distance to reach the observer and 
hence take a longer time. The time taken between the arrival of two 
successive wavefronts is hence longer at the observer than it is at the 
source. Thus, when the source moves away from the observer the 
frequency as measured by the source will be smaller. This is known as 
the Doppler effect. Astronomers call the increase in wavelength due to 
doppler effect as red shift since a wavelength in the middle of the visible 
region of the spectrum moves towards the red end of the spectrum. 









When waves are received from a source moving towards the observer, 
there is an apparent decrease in wavelength, this is referred to as blue 
shift. 

You have already encountered Doppler effect for sound waves in Chapter 
15 of Class XI textbook. For velocities small compared to the speed of 
light, we can use the same formulae which we use for sound waves. The 
fractional change in frequency Av/v is given by -v radia |/c, where v radia! is 

the component of the source velocity along the line joining the observer 
to the source relative to the observer; v radia | is considered positive when 

the source moves away from the observer. Thus, the Doppler shift can be 
expressed as: 

Av< = - V -^L ( 10 . 9 ) 
v c 

The formula given above is valid only when the speed of the source is 
small compared to that of light. A more accurate formula for the Doppler 
effect which is valid even when the speeds are close to that of light, 
requires the use of Einstein’s special theory of relativity. The Doppler 
effect for light is very important in astronomy. It is the basis for the 
measurements of the radial velocities of distant galaxies. 


Example 10.1 What speed should a galaxy move with respect to us so that the sodium line at 
589.0 nm is observed at 589.6 nm? 

A t* A/t 

Solution Since vA = c, (for small changes in v and A). For 

V A 

AA = 589.6 - 589.0 = + 0.6 nm 
we get [using Eq. (10.9)] 




A l' \A 

■l 

v A 

0 9 \ 

or, i/ radia i ~ i c " = i 3.06 x 10"ms 

58S,0 J 

= 306 km/s 

Therefore, the galaxy is moving away from us. 

Example 10.2 

(a) When monochromatic light is incident on a surface separating two media, the reflected 
and refracted light both have the same frequency as the incident frequency. Explain why? 

(b) When light travels from a rarer to a denser medium, the speed decreases. Does the 
reduction in speed imply a reduction in the energy carried by the light wave? 

(c) In the wave picture of light, intensity of light is determined by the square of the amplitude 
of the wave. What determines the intensity of light in the photon picture of light. 

Solution 

(a) Reflection and refraction arise through interaction of incident light with the atomic 
constituents of matter. Atoms may be viewed as 

oscillators, which take up the frequency of the external agency (light) causing forced 
oscillations. The frequency of light emitted by a charged oscillator equals its frequency of 
oscillation. Thus, the frequency of scattered light equals the frequency of incident light. 

(b) No. Energy carried by a wave depends on the amplitude of the wave, not on the speed of 
wave propagation. 

(c) For a given frequency, intensity of light in the photon picture is determined by the number 
of photons crossing an unit area per unit time. 






10.4 Coherent and Incoherent Addition of Waves 

In this section we will discuss the interference pattern produced by the 
superposition of two waves. You may recall that we had discussed the 
superposition principle in Chapter 15 of your Class XI textbook. Indeed 
the entire field of interference is based on the superposition principle 
according to which at a particular point in the medium, the resultant 
displacement produced by a number of waves is the vector sum of the 
displacements produced by each of the waves. 



N 


A 

















(b) 


Figure 10.8 (a) Two needles oscillating in phase in water represent two coherent sources, (b) The 
pattern of displacement of water molecules at an instant on the surface of water showing nodal N 
(no displacement) and antinodal A (maximum displacement) lines. 


Consider two needles S 1 and S 2 moving periodically up and down in an 

identical fashion in a trough of water [Fig. 10.8(a)]. They produce two 
water waves, and at a particular point, the phase difference between the 
displacements produced by each of the waves does not change with 
time; when this happens the two sources are said to be coherent. Figure 
10.8(b) shows the position of crests (solid circles) and troughs (dashed 
circles) at a given instant of time. Consider a point P for which 

S-, P = S 2 P 

Since the distances S 1 P and S 2 P are equal, waves from S-| and S 2 will 

take the same time to travel to the point P and waves that emanate from 
S-i and S 2 in phase will also arrive, at the point P, in phase. 

Thus, if the displacement produced by the source S -1 at the point P is 
given by 
y 1 = a cos w t 

then, the displacement produced by the source S 2 (at the point P) will 
also be given by 
y 2 = a cos w t 

Thus, the resultant of displacement at P would be given by 
y = yi + y 2 = 2 a cos art 



Since the intensity is proportional to the square of the amplitude, the 
resultant intensity will be given by 

/ = 4/ 0 

where / 0 represents the intensity produced by each one of the individual 

sources; / 0 is proportional to a 2 . In fact at any point on the perpendicular 

bisector of S<\S 2 , the intensity will be 4/ 0 . The two sources are said to 

interfere constructively and we have what is referred to as constructive 
interference. We next consider a point Q [Fig. 10.9(a)] 

for which 

S 2 Q -S-,Q = 2A 

The waves emanating from S-| will arrive exactly two cycles earlier than 
the waves from S 2 and will again be in phase [Fig. 10.9(a)]. Thus, if the 
displacement produced by S 1 is given by 

y 1 = a cos w t 

then the displacement produced by S 2 will be given by 
y 2 = a cos (<jl) t - 4tt) = a cos wf 

where we have used the fact that a path difference of 2A corresponds to 
a phase difference of 4tt. The two displacements are once again in 
phase and the intensity will again be 4 / 0 giving rise to constructive 

interference. In the above analysis we have assumed that the distances 
S-|Q and S 2 Q are much greater than d (which represents the distance 

between S 1 and S 2 ) so that although S-|Q and S 2 Q are not equal, the 

amplitudes of the displacement produced by each wave are very nearly 



the same. 



S,Q-S 1 Q=2a 


(a) 



(b) 

Figure 10.9 

(a) Constructive interference at a point Q for which the path difference is 2A. (b) Destructive 
interference at a point R for which the path difference is 


We next consider a point R [Fig. 10.9(b)] for which 
S 2 R - S-|R = -2.5A 

The waves emanating from S-| will arrive exactly two and a half cycles 
later than the waves from S 2 [Fig. 10.10(b)]. Thus if the displacement 






produced by S-| is given by 
y 1 = a cos w t 

then the displacement produced by S 2 will be given by 
y 2 = a cos (uj t + 5tt) = - a cos wf 

where we have used the fact that a path difference of 2.5A corresponds 
to a phase difference of 5tt. The two displacements are now out of phase 
and the two displacements will cancel out to give zero intensity. This is 
referred to as destructive interference. 

To summarise: If we have two coherent sources S-| and S 2 vibrating in 
phase, then for an arbitrary point P whenever the path difference, 

S-,P - S 2 P = nK (n = 0, 1, 2, 3,...) (10.10) 

we will have constructive interference and the resultant intensity will be 
4/ 0 ; the sign - between S-|P and S 2 P represents the difference between 

S-|P and S 2 P. On the other hand, if the point P is such that the path 

difference, 

S-|P ~ S 2 P = (n+ 1 ) A (r? = 0, 1,2,3, ...) (10.11) 

2 

we will have destructive interference and the resultant intensity will be 
zero. Now, for any other arbitrary point G (Fig. 10.10) let the phase 
difference between the two displacements be cp. Thus, if the 
displacement produced by S 1 is given by 




Figure 10.10 Locus of points for which SiP- SgP is equal to zero, ±A, ± 2A, ± 3A. 

y 1 = a cos w t 

then, the displacement produced by S 2 would be 
y 2 = a cos (w t + (p) 

and the resultant displacement will be given by 
y = y 1 + y 2 

= a [cos w t + cos (wf +cp)] 

= 2 a cos (cp/2) cos (iof + (p/2) 



The amplitude of the resultant displacement is 2a cos ((p/2) and therefore 
the intensity at that point will be 

/ = 4 / 0 cos 2 ((p/2) (10.12) 

If (p = 0, ± 2 tt, ± 4 tt,... which corresponds to the condition given by Eq. 
(10.10) we will have constructive interference leading to maximum 
intensity. On the other hand, if (p = ± tt, ± 3 tt, ± 5tt ... [which corresponds 
to the condition given by Eq. (10.11)] we will have destructive 
interference leading to zero intensity. 

Now if the two sources are coherent (i.e., if the two needles are going up 











and down regularly) then the phase difference cp at any point will not 
change with time and we will have a stable interference pattern; i.e., the 
positions of maxima and minima will not change with time. However, if 
the two needles do not maintain a constant phase difference, then the 
interference pattern will also change with time and, if the phase 
difference changes very rapidly with time, the positions of maxima and 
minima will also vary rapidly with time and we will see a “time-averaged” 
intensity distribution. When this happens, we will observe an average 
intensity that will be given by 

< l >= 4/ r < cos J (0/2) > (10.13) 

where angular brackets represent time averaging. Indeed it is shown in 
Section 7.2 that if (p (t) varies randomly with time, the time-averaged 

quantity < cos 2 (cp/2) > will be 1/2. This is also intuitively obvious because 

the function cos 2 (cp/2) will randomly vary between 0 and 1 and the 
average value will be 1/2. The resultant intensity will be given by 

/ = 2 / 0 (10.14) 
at all points. 

When the phase difference between the two vibrating sources changes 
rapidly with time, we say that the two sources are incoherent and when 
this happens the intensities just add up. This is indeed what happens 
when two separate light sources illuminate a wall. 

10.5 Interference of Light Waves and Young’s Experiment 

We will now discuss interference using light waves. If we use two sodium 
lamps illuminating two pinholes (Fig. 10.11) we will not observe any 
interference fringes. This is because of the fact that the light wave 


emitted from an ordinary source (like a sodium lamp) undergoes abrupt 

phase changes in times of the order of 10 -10 seconds. Thus the light 
waves coming out from two independent sources of light will not have 
any fixed phase relationship and would be incoherent, when this 
happens, as discussed in the previous section, the intensities on the 
screen will add up. 


The British physicist Thomas Young used an ingenious technique to 
“lock” the phases of the waves emanating from S 1 and S 2 . He made two 

pinholes S-| and S 2 (very close to each other) on an opaque screen [Fig. 

10.12(a)]. These were illuminated by another pinholes that was in turn, lit 
by a bright source. Light waves spread out from S and fall on both S 1 and 

S 2 . S 1 and S 2 then behave like two coherent sources because light 

waves coming out from S-| and S 2 are derived from the same original 

source and any abrupt phase change in S will manifest in exactly similar 
phase changes in the light coming out from S 1 and S 2 . Thus, the two 

sources S-| and S 2 will be locked in phase; i.e., they will be coherent like 


the two vibrating needle in our water wave example [Fig. 10.8(a)]. 


Sodium Lamp 



Si 

s 2 


Sodium Lamp 


Screen 


FIGURE 10.11 If two sodium lamps illuminate two pinholes SI and S2, the intensities will add up 
and no interference fringes will be observed on the screen. 
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FIGURE 10.12 Young’s arrangement to produce interference pattern. 

Thus spherical waves emanating from S 1 and S 2 will produce 

interference fringes on the screen GG', as shown in Fig. 10.12(b). The 
positions of maximum and minimum intensities can be calculated by 
using the analysis given in Section 10.4 where we had shown that for an 
arbitrary point P on the line GG' [Fig. 10.12(b)] to correspond to a 
maximum, we must have 

S 2 P - S-,P = n\] n = 0, 1, 2 ... (10.15) 

Now, 

(a) (b) 

Figure 10.12 Young’s arrangement to produce interference pattern. 
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where S 1 S 2 = c/ and OP = x. Thus 


= 2 xd 























S 2 P-S 1 P = 


2 xd 


S 2 P+S,P 


(10.16) 


If x, d«D then negligible error will be introduced if 

S 2 P + S -|P (in the denominator) is replaced by 2D. For example, for d = 

0.1 cm, D = 100 cm, OP = 1 cm (which correspond to typical values for 
an interference experiment using light waves), we have 


S 2 P + = [(100) 2 + (1.05) 2 ] 1/2 + [(100) 2 + (0.95) 2 ] 1/2 


= 200.01 cm 

Thus if we replace S 2 P + S-|P by 2 D, the error involved is about 0.005%. 
In this approximation, Eq. (10.16) becomes 


Thomas Young (1773 - 1829) 



Thomas Young (1773 - 1829) English physicist, physician and 
Egyptologist. Young worked on a wide variety of scientific problems, 
ranging from the structure of the eye and the mechanism of vision to 
the decipherment of the Rosetta stone. He revived the wave theory of 
light and recognised that interference phenomena provide proof of the 



wave properties of light. 


S 2 P-S 1 P* ^ (10.17) 

Hence we will have constructive interference resulting in a bright region 

when — = nk [Eq. (10.15)]. That is, 

D 

x = x n = ~ ; n = 0, ± 1, ± 2, ... (10.18) 

a 

On the other hand, we will have destructive interference resulting in a 

xd 1 

dark region when = (/?+— ) A that is 

x = x n = (n+i) ,t -0,11-2 (10.19) 

2 a 

Thus dark and bright bands appear on the screen, as shown in Fig. 
10.13. Such bands are called fringes. Equations (10.18) and (10.19) 
show that dark and bright fringes are equally spaced and the distance 
between two consecutive bright and dark fringes is given by 

(3 = x n+1 -x n or (3 = ^ (10.20) 

d 

which is the expression for the fringe width. Obviously, the central point O 
(in Fig. 10.12) will be bright because S-|0 = S 2 0 and it will correspond to 

n = 0 [Eq. (10.18)]. If we consider the line perpendicular to the plane of 
the paper and passing through O [i.e., along the y-axis] then all points on 
this line will be equidistant from S<| and S 2 and we will have a bright 





central fringe which is a straight line as shown in Fig. 10.13. In order to 
determine the shape of the interference pattern on the screen we note 
that a particular fringe would correspond to the locus of points with a 
constant value of S 2 P - S-iP. Whenever this constant is an integral 

multiple of A, the fringe will be bright and whenever it is an odd integral 
multiple of A/2 it will be a dark fringe. Now, the locus of the point P lying in 
the x-y plane such that S 2 P - S-\P (= A) is a constant, is a hyperbola. 

Thus the fringe pattern will strictly be a hyperbola; however, if the 
distance D is very large compared to the fringe width, the fringes will be 
very nearly straight lines as shown in Fig. 10.13. 
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Figure 10.13 Computer generated fringe pattern produced by two point source S-j and S 2 on the 


screen GG' (Fig. 10.12); (a) and (b) correspond to d = 0.005 mm and 0.025 mm, respectively 

(both figures correspond to D = 5 cm and A = 5 * 1CT^ cm.) (Adopted from OPTICS by A. Ghatak, 
Tata McGraw Hill Publishing Co. Ltd., New Delhi, 2000.) 








































In the double-slit experiment shown in Fig. 10.12(b), we have taken the source hole S on the 
perpendicular bisector of the two slits, which is shown as the line SO. What happens if the 
source S is slightly away from the perpendicular bisector. Consider that the source is moved 
to some new point S' and suppose that Q is the mid-point of S-| and S 2 - If the angle S'QS is 
cp, then the central bright fringe occurs at an angle -<p, on the other side. Thus, if the source S 
is on the perpendicular bisector, then the central fringe occurs at O, also on the perpendicular 
bisector. If S is shifted by an angle cp to point S', then the central fringe appears at a point O' 
at an angle -cp, which means that it is shifted by the same angle on the other side of the 
bisector. This also means that the source S', the mid-point Q and the point O' of the central 
fringe are in a straight line. 
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We end this section by quoting from the Nobel lecture of Dennis Gabor* 

The wave nature of light was demonstrated convincingly for the first time 
in 1801 by Thomas Young by a wonderfully simple experiment. He let a 
ray of sunlight into a dark room, placed a dark screen in front of it, 
pierced with two small pinholes, and beyond this, at some distance, a 
white screen. He then saw two darkish lines at both sides of a bright line, 
which gave him sufficient encouragement to repeat the experiment, this 
time with spirit flame as light source, with a little salt in it to produce the 
bright yellow sodium light. This time he saw a number of dark lines, 





regularly spaced; the first clear proof that light added to light can produce 
darkness. This phenomenon is called interference. Thomas Young had 
expected it because he believed in the wave theory of light. 

*Dennis Gabor received the 1971 Nobel Prize in Physics for discovering theprinciples of 
holography 

We should mention here that the fringes are straight lines although S 1 

and S 2 are point sources. If we had slits instead of the point sources (Fig. 

10.14), each pair of points would have produced straight line fringes 
resulting in straight line fringes with increased intensities. 

single slJ C s 



/ 




31 21 Ik 0 2k 37 

-► Path differenf’e 


Figure 10.14 Photograph and the graph of the intensity distribution in Young’s double-slit 

experiment. 


Example 10.3 Two slits are made one millimetre apart and the screen is placed one metre 












away. What is the fringe separation when blue-green light of wavelength 500 nm is used? 

I!" DX Ix5xl0 -7 

Solution Fringe spacing =-=-m 

d lxlO" 3 

= 5 x 1CT 4 m = 0.5 mm 

Example 10.4 What is the eff ect on the interference fringes in a Young’s double-slit 
experiment due to each of the following operations: 

(a) the screen is moved away from the plane of the slits; 

(b) the (monochromatic) source is replaced by another (monochromatic) source of shorter 
wavelength; 

(c) the separation between the two slits is increased; 

(d) the source slit is moved closer to the double-slit plane; 

(e) the width of the source slit is increased; 

(f) the monochromatic source is replaced by a source of white 
light? 

(In each operation, take all parameters, other than the one specified, to remain unchanged.) 

Solution 

(a) Angular separation of the fringes remains constant (= A/d). The actual separation of the 
fringes increases in proportion to the distance of the screen from the plane of the 

two slits. 

(b) The separation of the fringes (and also angular separation) decreases. See, however, the 
condition mentioned in (d) below. 

(c) The separation of the fringes (and also angular separation) decreases. See, however, the 







condition mentioned in (d) below. 

(d) Let s be the size of the source and S its distance from the plane of the two slits. For 
interference fringes to be seen, the condition 

s/S < A/d should be satisfied; otherwise, interference patterns produced by different parts of 
the source overlap and no fringes are seen. Thus, as S decreases (i.e., the source slit is 
brought closer), the interference pattern gets less and less sharp, and when the source is 
brought too close for this condition to be valid, the fringes disappear. Till this happens, the 
fringe separation remains fixed. 

(e) Same as in (d). As the source slit width increases, fringe pattern gets less and less sharp. 
When the source slit is so wide that the condition s/S < A/d is not satisfied, the interference 
pattern disappears. 

(f) The interference patterns due to different component colours of white light overlap 
(incoherently). The central bright fringes for different colours are at the same position. 
Therefore, the central fringe is white. For a point P for which S 2 P -S-,P = A b /2, where A b 

(« 4000 A) represents the wavelength for the blue colour, the blue component will be absent 
and the fringe will appear red in colour. Slightly farther away where S 2 Q-S-|Q = A b = A/2 

where A r (« 8000 A) is the wavelength for the red colour, the fringe will be predominantly blue. 

Thus, the fringe closest on either side of the central white fringe is red and the farthest will 
appear blue. After a few fringes, no clear fringe pattern is seen. 


10.6 Diffraction 

If we look clearly at the shadow cast by an opaque object, close to the 
region of geometrical shadow, there are alternate dark and bright regions 
just like in interference. This happens due to the phenomenon of 
diffraction. Diffraction is a general characteristic exhibited by all types of 




waves, be it sound waves, light waves, water waves or matter waves. 
Since the wavelength of light is much smaller than the dimensions of 
most obstacles; we do not encounter diffraction effects of light in 
everyday observations. However, the finite resolution of our eye or of 
optical instruments such as telescopes or microscopes is limited due to 
the phenomenon of diffraction. Indeed the colours that you see when a 
CD is viewed is due to diffraction effects. We will now discuss the 
phenomenon of diffraction. 

10.6.1 The single slit 

In the discussion of Young’s experiment, we stated that a single narrow 
slit acts as a new source from which light spreads out. Even before 
Young, early experimenters - including Newton - had noticed that light 
spreads out from narrow holes and slits. It seems to turn around corners 
and enter regions where we would expect a shadow. These effects, 
known as diffraction, can only be properly understood using wave ideas. 
After all, you are hardly surprised to hear sound waves from someone 
talking around a corner! 

When the double slit in Young’s experiment is replaced by a single 
narrow slit (illuminated by a monochromatic source), a broad pattern with 
a central bright region is seen. On both sides, there are alternate dark 
and bright regions, the intensity becoming weaker away from the centre 
(Fig. 10.16). To understand this, go to Fig. 10.15, which shows a parallel 
beam of light falling normally on a single slit LN of width a. The diffracted 
light goes on to meet a screen. The midpoint of the slit is M. 

A straight line through M perpendicular to the slit plane meets the screen 
at C. We want the intensity at any point P on the screen. As before, 
straight lines joining P to the different points L,M,N, etc., can be treated 


as parallel, making an angle 0 with the normal MC. 

The basic idea is to divide the slit into much smaller parts, and add their 
contributions at P with the proper phase differences. We are treating 
different parts of the wavefront at the slit as secondary sources. Because 
the incoming wavefront is parallel to the plane of the slit, these sources 
are in phase. 

The path difference NP - LP between the two edges of the slit can be 
calculated exactly as for Young’s experiment. From Fig. 10.15, 

NP - LP = NQ 

= a sin 0 

» a 0 (for smaller angles) (10.21) 

Similarly, if two points M 1 and M 2 in the slit plane are separated by y, the 

path difference M 2 P - M-jP « y0. We now have to sum up equal, coherent 

contributions from a large number of sources, each with a different 
phase. This calculation was made by Fresnel using integral calculus, so 
we omit it here. The main features of the diffraction pattern can be 
understood by simple arguments. 

At the central point C on the screen, the angle 0 is zero. All path 
differences are zero and hence all the parts of the slit contribute in phase. 
This gives maximum intensity at C. Experimental observation shown in 
Fig. 10.15 indicates that the intensity has a central maximum at 0 = 0 and 
other secondary maxima at 0 » (n+1/2) A/a, and has minima (zero 

intensity) at 0 « nh/a, 

n = ± 1, ±2, ±3, .... It is easy to see why it has minima at these values of 
angle. Consider first the angle 0 where the path difference a0 is A. Then, 

0 A/a . ( 10 . 22 ) 



Now, divide the slit into two equal halves LM and MN each of size a/2. 
For every point M 1 in LM, there is a point M 2 in MN such that M-|M 2 = a/2. 

The path difference between M 1 and M 2 at P = M 2 P - M-|P = 0a/2 = A/2 

for the angle chosen. This means that the contributions from M 1 and M 2 

are 180° out of phase and cancel in the direction 0 = A/a. Contributions 
from the two halves of the slit LM and MN, therefore, cancel each other. 
Equation (10.22) gives the angle at which the intensity falls to zero. One 
can similarly show that the intensity is zero for 0 = nh/a, with n being any 
integer (except zero!). Notice that the angular size of the central 
maximum increases when the slit width a decreases. 



Figure 10.15 The geometry of path differences for diffraction by a single slit. 


It is also easy to see why there are maxima at 0 = (n + 1/2) A/a and why 
they go on becoming weaker and weaker with increasing n. Consider an 
angle 0 = 3A/2a which is midway between two of the dark fringes. Divide 
the slit into three equal parts. If we take the first two thirds of the slit, the 
path difference between the two ends would be 
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The first two-thirds of the slit can therefore be divided into two halves 
which have a A/2 path difference. The contributions of these two halves 
cancel in the same manner as described earlier. Only the remaining one- 
third of the slit contributes to the intensity at a point between the two 
minima. Clearly, this will be much weaker than the central maximum 
(where the entire slit contributes in phase). One can similarly show that 
there are maxima at (n + 1/2) A/a with n = 2, 3, etc. These become 
weaker with increasing n, since only one-fifth, one-seventh, etc., of the 
slit contributes in these cases. The photograph and intensity pattern 
corresponding to it is shown in Fig. 10.16. 
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Figure 10.16 Intensity distribution and photograph of fringes due to diffraction 

at single slit. 


There has been prolonged discussion about difference between 
intereference and diffraction among scientists since the discovery of 
these phenomena. In this context, it is interesting to note what Richard 
Feynman* has said in his famous Feynman Lectures on Physics: 











No one has ever been able to define the difference between interference 
and diffraction satisfactorily. It is just a question of usage, and there is no 
specific, important physical difference between them. The best we can do 
is, roughly speaking, is to say that when there are only a few sources, 
say two interfering sources, then the result is usually called interference, 
but if there is a large number of them, it seems that the word diffraction is 
more often used. 

In the double-slit experiment, we must note that the pattern on the screen 
is actually a superposition of single-slit diffraction from each slit or hole, 
and the double-slit interference pattern. This is shown in 
Fig. 10.17. It shows a broader diffraction peak in which there appear 
several fringes of smaller width due to double-slit interference. The 
number of interference fringes occuring in the broad diffraction peak 
depends on the ratio dla, that is the ratio of the distance between the two 
slits to the width of a slit. In the limit of a becoming very small, the 
diffraction pattern will become very flat and we will obsrve the two-slit 
interference pattern [see Fig. 10.13(b)]. 



FIGURE 10.17 The actual double-slit interference pattern The envelope shows the single slit 

diffraction.. 


Example 10.5 In Example 10.3, what should the width of each slit be to obtain 10 maxima of 
the double slit pattern within the central maximum of the single slit pattern? 









/ 

Solution We want a 0 A , 0 

a 

I 0 ' - 2 a - ^ -0.2 nun 
d a 5 

Notice that the wavelength of light and distance of the screen do not 
enter in the calculation of a. 


In the double-slit interference experiment of Fig. 10.12, what happens if 
we close one slit? You will see that it now amounts to a single slit. But 
you will have to take care of some shift in the pattern. We now have a 
source at S, and only one hole (or slit) S 1 or S 2 . This will produce a 

single-slit diffraction pattern on the screen. The centre of the central 
bright fringe will appear at a point which lies on the straight line SS 1 or 

SS 2 , as the case may be. 

We now compare and contrast the interference pattern with that seen for 
a coherently illuminated single slit (usually called the single slit diffraction 
pattern). 

(i) The interference pattern has a number of equally spaced bright and 
dark bands. The diffraction pattern has a central bright maximum which is 
twice as wide as the other maxima. The intensity falls as we go to 
successive maxima away from the centre, on either side. 

(ii) We calculate the interference pattern by superposing two waves 
originating from the two narrow slits. The diffraction pattern is a 
superposition of a continuous family of waves originating from each point 
on a single slit. 

* Richand Feynman was one of the recipients of the 1965 Nobel Prize in Physics for his 




fundamental work in quantum electrodynamics. 


(iii) For a single slit of width a, the first null of the interference pattern 
occurs at an angle of A/a. At the same angle of A/a, we get a maximum 
(not a null) for two narrow slits separated by a distance a. 

One must understand that both d and a have to be quite small, to be able 
to observe good interference and diffraction patterns. For example, the 
separation d between the two slits must be of the order of a milimetre or 
so. The width a of each slit must be even smaller, of the order of 0.1 or 
0.2 mm. 

In our discussion of Young’s experiment and the single-slit diffraction, we 
have assumed that the screen on which the fringes are formed is at a 
large distance. The two or more paths from the slits to the screen were 
treated as parallel. This situation also occurs when we place a 
converging lens after the slits and place the screen at the focus. Parallel 
paths from the slit are combined at a single point on the screen. Note that 
the lens does not introduce any extra path differences in a parallel beam. 
This arrangement is often used since it gives more intensity than placing 
the screen far away. If f is the focal length of the lens, then we can easily 
work out the size of the central bright maximum. In terms of angles, the 
separation of the central maximum from the first null of the diffraction 
pattern is A/a. Hence, the size on the screen will be fk/a. 

10.6.2 Seeing the single slit diffraction pattern 

It is surprisingly easy to see the single-slit diffraction pattern for oneself. 
The equipment needed can be found in most homes — two razor blades 
and one clear glass electric bulb preferably with a straight filament. One 
has to hold the two blades so that the edges are parallel and have a 
narrow slit in between. This is easily done with the thumb and forefingers 


(Fig. 10.18). 

Keep the slit parallel to the filament, right in front of the eye. Use 
spectacles if you normally do. With slight adjustment of the width of the 
slit and the parallelism of the edges, the pattern should be seen with its 
bright and dark bands. Since the position of all the bands (except the 
central one) depends on wavelength, they will show some colours. Using 
a filter for red or blue will make the fringes clearer. With both filters 
available, the wider fringes for red compared to blue can be seen. 

In this experiment, the filament plays the role of the first slit S in Fig. 
10.16. The lens of the eye focuses the pattern on the screen (the retina 
of the eye). 
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Figure 10.18 Holding two blades to form a single slit. A bulb filament viewed through this shows 

clear diffraction bands. 


With some effort, one can cut a double slit in an aluminium foil with a 
blade. The bulb filament can be viewed as before to repeat Young’s 
experiment. In daytime, there is another suitable bright source 
subtending a small angle at the eye. This is the reflection of the Sun in 
any shiny convex surface (e.g., a cycle bell). Do not try direct sunlight - it 
can damage the eye and will not give fringes anyway as the Sun 
subtends an angle 
of (1/2)°. 










In interference and diffraction, light energy is redistributed. If it reduces in 
one region, producing a dark fringe, it increases in another region, 
producing a bright fringe. There is no gain or loss of energy, which is 
consistent with the principle of conservation of energy. 


10.6.3 Resolving power of optical instruments 

In Chapter 9 we had discussed about telescopes. The angular resolution 
of the telescope is determined by the objective of the telescope. The 
stars which are not resolved in the image produced by the objective 
cannot be resolved by any further magnification produced by the 
eyepiece. The primary purpose of the eyepiece is to provide 
magnification of the image produced by the objective. 


Consider a parallel beam of light falling on a convex lens. If the lens is 
well corrected for aberrations, then geometrical optics tells us that the 
beam will get focused to a point. However, because of diffraction, the 
beam instead of getting focused to a point gets focused to a spot of finite 
area. In this case the effects due to diffraction can be taken into account 
by considering a plane wave incident on a circular aperture followed by a 
convex lens (Fig. 10.19). The analysis of the corresponding diffraction 
pattern is quite involved; however, in principle, it is similar to the analysis 
carried out to obtain the single-slit diffraction pattern. Taking into account 
the effects due to diffraction, the pattern on the focal plane would consist 
of a central bright region surrounded by concentric dark and bright rings 
(Fig. 10.19). A detailed analysis shows that the radius of the central bright 
region is approximately given by 
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(10.24) 


where f is the focal length of the lens and 2a is the diameter of the 




circular aperture or the diameter of the lens, whichever is smaller. 
Typically if 

A » 0.5 pm, f~ 20 cm and a « 5 cm 
we have 
r 0 » 1.2 pm 

Although the size of the spot is very small, it plays an important role in 
determining the limit of resolution of optical instruments like a telescope 
or a microscope. For the two stars to be just resolved 



Figure 10.19 A parallel beam of light is incident on a convex lens. Because of diffraction effects, 


the beam gets focused to a spot of radius « 0.61 Af/a. 
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Thus A0 will be small if the diameter of the objective is large. This implies 
that the telescope will have better resolving power if a is large. It is for 
this reason that for better resolution, a telescope must have a large 
diameter objective. 










Example 10.6 Assume that light of wavelength 6000A is coming from a star. What is the limit 
of resolution of a telescope whose objective has a diameter of 100 inch? 

Solution A 100 inch telescope implies that 2a = 100 inch = 254 cm. Thus if, 

A*6000A = 6x 10 -5 cm 
then 

().!>1 x (i 11) ^ 

A0 ----- 2 c f v 10 radians 
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We can apply a similar argument to the objective lens of a microscope. In 
this case, the object is placed slightly beyond f, so that a real image is 
formed at a distance v [Fig. 10.20]. The magnification (ratio of image size 
to object size) is given by m » v/f. It can be seen from Fig. 10.20 that 

D/f* 2 tan (3 (10.26) 

where 2(3 is the angle subtended by the diameter of the objective lens at 
the focus of the microscope. 
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FIGURE 10.20 Real image formed by the objective lens of the microscope. 


Determine the resolving power of your eye 















You can estimate the resolving power of your eye with a simple experiment. Make black 
stripes of equal width separated by white stripes; see figure here. All the black stripes should 
be of equal width, while the width of the intermediate white stripes should increase as you go 
from the left to the right. For example, let all black stripes have a width of 5 mm. Let the width 
of the first two white stripes be 0.5 mm each, the next two white stripes be 1 mm each, the 
next two 1.5 mm each, etc. Paste this pattern on a wall in a room or laboratory, at the height 
of your eye. 



Now watch the pattern, preferably with one eye. By moving away or 
closer to the wall, find the position where you can just see some two 
black stripes as separate stripes. All the black stripes to the left of this 
stripe would merge into one another and would not be distinguishable. 
On the other hand, the black stripes to the right of this would be more 
and more clearly visible. Note the width d of the white stripe which 
separates the two regions, and measure the distance D of the wall 
from your eye. Then d/D is the resolution of your eye. 

You have watched specks of dust floating in air in a sunbeam entering 
through your window. Find the distance (of a speck) which you can 
clearly see and distinguish from a neighbouring speck. Knowing the 
resolution of your eye and the distance of the speck, estimate the size 
of the speck of dust. 

When the separation between two points in a microscopic specimen is 
comparable to the wavelength A of the light, the diffraction effects 
become important. The image of a point object will again be a diffraction 
pattern whose size in the image plane will be 


vO = i: 
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(10.27) 


Two objects whose images are closer than this distance will not be 
resolved, they will be seen as one. The corresponding minimum 
separation, c/min, in the object plane is given by 


c/min = 
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Now, combining Eqs. (10.26) and (10.28), we get 
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If the medium between the object and the objective lens is not air but a 
medium of refractive index n, Eq. (10.29) gets modified to 
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The product n sin(3 is called the numerical aperture and is sometimes 
marked on the objective. 

The resolving power of the microscope is given by the reciprocal of the 



minimum separation of two points seen as distinct. It can be seen from 
Eq. (10.30) that the resolving power can be increased by choosing a 
medium of higher refractive index. Usually an oil having a refractive index 
close to that of the objective glass is used. Such an arrangement is called 
an ‘oil immersion objective’. Notice that it is not possible to make sin(3 
larger than unity. Thus, we see that the resolving power of a microscope 
is basically determined by the wavelength of the light used. 

There is a likelihood of confusion between resolution and magnification, 
and similarly between the role of a telescope and a microscope to deal 
with these parameters. A telescope produces images of far objects 
nearer to our eye. Therefore objects which are not resolved at far 
distance, can be resolved by looking at them through a telescope. A 
microscope, on the other hand, magnifies objects (which are near to us) 
and produces their larger image. We may be looking at two stars or two 
satellites of a far-away planet, or we may be looking at different regions 
of a living cell. In this context, it is good to remember that a telescope 
resolves whereas a microscope magnifies. 

10.6.4 The validity of ray optics 

An aperture (i.e., slit or hole) of size a illuminated by a parallel beam 
sends diffracted light into an angle of approximately » A/a. This is the 
angular size of the bright central maximum. In travelling a distance z, the 
diffracted beam therefore acquires a width zA/a due to diffraction. It is 
interesting to ask at what value of z the spreading due to diffraction 
becomes comparable to the size a of the aperture. We thus 
approximately equate zA/a with a. This gives the distance beyond which 
divergence of the beam of width a becomes significant. Therefore, 


We define a quantity z F called the Fresnel distance by the following 
equation 

; « 2 / / 

Equation (10.31) shows that for distances much smaller than z F , the 

spreading due to diffraction is smaller compared to the size of the beam. 
It becomes comparable when the distance is approximately z F . For 

distances much greater than z F , the spreading due to diffraction 

dominates over that due to ray optics (i.e., the size a of the aperture). 
Equation (10.31) also shows that ray optics is valid in the limit of 
wavelength tending to zero. 


Example 10.7 For what distance is ray optics a good approximation when the aperture is 3 
mm wide and the wavelength is 500 nm? 

f y -1 n :J ' 

Solution ^ _ ■''" ‘ _ 

F a r> x i o ■' 

This example shows that even with a small aperture, diffraction spreading can be neglected 
for rays many metres in length. Thus, ray optics is valid in many common situations. 
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10.7 Polarisation 

Consider holding a long string that is held horizontally, the other end of 
which is assumed to be fixed. If we move the end of the string up and 
down in a periodic manner, we will generate a wave propagating in the +x 






direction (Fig. 10.21). Such a wave could be described by the following 
equation 
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Figure 10.21 (a) The curves represent the displacement of a string at t = 0 and at t = At, 
respectively when a sinusoidal wave is propagating in the +x-direction. (b) The curve represents 
the time variation of the displacement atx = 0 when a sinusoidal wave is propagating in the +x- 
direction. At x = Ax, the time variation of the displacement will be slightly displaced to the right. 

y (x,t) = a sin (kx- cof) (10.32) 

where a and co (= 2 ttv ) represent the amplitude and the angular 
frequency of the wave, respectively; further, 










represents the wavelength associated with the wave. We had discussed 
propagation of such waves in Chapter 15 of Class XI textbook. Since the 
displacement (which is along the y direction) is at right angles to the 
direction of propagation of the wave, we have what is known as a 
transverse wave. Also, since the displacement is in the y direction, it is 
often referred to as a y-polarised wave. Since each point on the string 
moves on a straight line, the wave is also referred to as a linearly 
polarised wave. Further, the string always remains confined to the x-y 
plane and therefore it is also referred to as a plane polarised wave. 

In a similar manner we can consider the vibration of the string in the x-z 
plane generating a z-polarised wave whose displacement will be given by 

z ( x,t) = a sin ( kx-ujt ) (10.34) 

It should be mentioned that the linearly polarised waves [described by 
Eqs. (10.33) and (10.34)] are all transverse waves; i.e., the displacement 
of each point of the string is always at right angles to the direction of 
propagation of the wave. Finally, if the plane of vibration of the string is 
changed randomly in very short intervals of time, then we have what is 
known as an unpolarised wave. Thus, for an unpolarised wave the 
displacement will be randomly changing with time though it will always be 
perpendicular to the direction of propagation. 

Light waves are transverse in nature; i.e., the electric field associated 
with a propagating light wave is always at right angles to the direction of 
propagation of the wave. This can be easily demonstrated using a simple 
Polaroid. You must have seen thin plastic like sheets, which are called 
polaroids. A polaroid consists of long chain molecules aligned in a 
particular direction. The electric vectors (associated with the propagating 



light wave) along the direction of the aligned molecules get absorbed. 
Thus, if an unpolarised light wave is incident on such a Polaroid then the 
light wave will get linearly polarised with the electric vector oscillating 
along a direction perpendicular to the aligned molecules; this direction is 
known as the pass-axis of the Polaroid. 

Thus, if the light from an ordinary source (like a sodium lamp) passes 
through a Polaroid sheet P-, it is observed that its intensity is reduced by 

half. Rotating P 1 has no effect on the transmitted beam and transmitted 

intensity remains constant. Now, let an identical piece of Polaroid P 2 be 

placed before P v As expected, the light from the lamp is reduced in 

intensity on passing through P 2 alone. But now rotating P 1 has a dramatic 

effect on the light coming from P 2 . In one position, the intensity 

transmitted by P 2 followed by P 1 is nearly zero. When turned by 90° from 

this position, P 1 transmits nearly the full intensity emerging from P 2 (Fig. 

10 . 22 ). 

The above experiment can be easily understood by assuming that light 
passing through the Polaroid P 2 gets polarised along the pass-axis of P 2 . 

If the pass-axis of P 2 makes an angle 0 with the pass-axis of P 1; then 

when the polarised beam passes through the Polaroid P 2 , the component 

E cos 0 (along the pass-axis of P 2 ) will pass through P 2 . Thus, as we 

rotate the Polaroid P- 1 (or P 2 ), the intensity will vary as: 

/=/ o cos 2 0 (10.35) 

where / 0 is the intensity of the polarized light after passing through P-j. 

This is known as Malus’ law. The above discussion shows that the 
intensity coming out of a single Polaroid is half of the incident intensity. By 



putting a second Polaroid, the intensity can be further controlled from 
50% to zero of the incident intensity by adjusting the angle between the 
pass-axes of two polaroids. 



(b) 

Figure 10.22 (a) Passage of light through two polaroids P2 and P-f. The transmitted fraction falls 
from 1 to 0 as the angle between them varies from 0° to 90°. Notice that the light seen through a 
single Polaroid Pi does not vary with angle, (b) Behaviour of the electric vector when light passes 

through two polaroids. The transmitted polarisation is the component parallel to the Polaroid 
axis. The double arrows show the oscillations of the electric vector. 


Polaroids can be used to control the intensity, in sunglasses, 
windowpanes, etc. Polaroids are also used in photographic cameras and 
3D movie cameras. 


Example 10.8 Discuss the intensity of transmitted light when a Polaroid sheet is rotated 
between two crossed polaroids? 


























Solution Let lg be the intensity of polarised light after passing through the first polariser P<|. 
Then the intensity of light after passing through second polariser P 2 will be 

I = os" <9 

where 0 is the angle between pass axes of P 1 and P 2 . Since P-, and P 3 are crossed the angle 

between the pass axes of P 2 and P 3 will be 

(tt/2-0). Hence the intensity of light emerging from P 3 will be 

1 = 1 cos- I - - 9 I 

12 ) 

= l 0 cos 2 0 sin 2 0 =(l 0 /4) sin 2 20 

Therefore, the transmitted intensity will be maximum when 0 = tt/4. 


10.7.1 Polarisation by scattering 

The light from a clear blue portion of the sky shows a rise and fall of 
intensity when viewed through a Polaroid which is rotated. This is nothing 
but sunlight, which has changed its direction (having been scattered) on 
encountering the molecules of the earth’s atmosphere. As Fig. 10.23(a) 
shows, the incident sunlight is unpolarised. The dots stand for 
polarisation perpendicular to the plane of the figure. The double arrows 
show polarisation in the plane of the figure. (There is no phase relation 
between these two in unpolarised light). Under the influence of the 
electric field of the incident wave the electrons in the molecules acquire 
components of motion in both these directions. We have drawn an 
observer looking at 90° to the direction of the sun. Clearly, charges 
accelerating parallel to the double arrows do not radiate energy towards 




this observer since their acceleration has no transverse component. The 
radiation scattered by the molecule is therefore represented by dots. It is 
polarised perpendicular to the plane of the figure. This explains the 
polarisation of scattered light from the sky. 
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FIGURE 10.23 (a) Polarisation of the blue scattered light from the sky. The incident sunlight is 
unpolarised (dots and arrows). A typical molecule is shown. It scatters light by 90° polarised 
normal to the plane of the paper (dots only), (b) Polarisation of light reflected from a transparent 
medium at the Brewster angle (reflected ray perpendicular to refracted ray). 

The scattering of light by molecules was intensively investigated by C.V. 
Raman and his collaborators in Kolkata in the 1920s. Raman was 
awarded the Nobel Prize for Physics in 1930 for this work. 


A SPECIAL CASE OF TOTAL TRANSMISSION 

When light is incident on an interface of two media, it is observed that some part of it gets 
reflected and some part gets transmitted. Consider a related question: Is it possible that 
under some conditions a monochromatic beam of light incident on a surface (which is 
normally reflective) gets completely transmitted with no reflection? To your surprise, the 


answer is yes. 
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Screen 

Let us try a simple experiment and check what happens. Arrange a laser, a good polariser, a 
prism and screen as shown in the figure here. 

Let the light emitted by the laser source pass through the polariser and be incident on the 
surface of the prism at the Brewster’s angle of incidence \ q . Now rotate the polariser carefully 

and you will observe that for a specific alignment of the polariser, the light incident on the 
prism is completely transmitted and no light is reflected from the surface of the prism. The 
reflected spot will completely vanish. 


10.7.2 Polarisation by reflection 

Figure 10.23(b) shows light reflected from a transparent medium, say, 
water. As before, the dots and arrows indicate that both polarisations are 
present in the incident and refracted waves. We have drawn a situation in 
which the reflected wave travels at right angles to the refracted wave. 
The oscillating electrons in the water produce the reflected wave. These 
move in the two directions transverse to the radiation from wave in the 
medium, i.e., the refracted wave. The arrows are parallel to the direction 
of the reflected wave. Motion in this direction does not contribute to the 
reflected wave. As the figure shows, the reflected light is therefore 
linearly polarised perpendicular to the plane of the figure (represented by 
dots). This can be checked by looking at the reflected light through an 
analyser. The transmitted intensity will be zero when the axis of the 
analyser is in the plane of the figure, i.e., the plane of incidence. 






When unpolarised light is incident on the boundary between two 
transparent media, the reflected light is polarised with its electric vector 
perpendicular to the plane of incidence when the refracted and reflected 
rays make a right angle with each other. Thus we have seen that when 
reflected wave is perpendicular to the refracted wave, the reflected wave 
is a totally polarised wave. The angle of incidence in this case is called 
Brewster’s angle and is denoted by / B . We can see that / B is related to the 

refractive index of the denser medium. Since we have i B +r = tt / 2 , we get 

from Snell’s law 

sin i ,, sin f H 

- 

sin r sin|7i /2 - i B j 

sim\, 

1 = tan t.. (10.36) 

COST,, 

This is known as Brewster’s law. 


Example 10.9 Unpolarised light is incident on a plane glass surface. What should be the 
angle of incidence so that the reflected and refracted rays are perpendicular to each other? 

Solution For i + r to be equal to tt/2, we should have tan iB = p = 1.5. This gives i B = 57°. 

This is the Brewster’s angle for air to glass 
interface. 

For simplicity, we have discussed scattering of light by 90°, and reflection 
at the Brewster angle. In this special situation, one of the two 
perpendicular components of the electric field is zero. At other angles, 
both components are present but one is stronger than the other. There is 
no stable phase relationship between the two perpendicular components 




since these are derived from two perpendicular components of an 
unpolarised beam. When such light is viewed through a rotating analyser, 
one sees a maximum and a minimum of intensity but not complete 
darkness. This kind of light is called partially polarised. 

Let us try to understand the situation. When an unpolarised beam of light 
is incident at the Brewster’s angle on an interface of two media, only part 
of light with electric field vector perpendicular to the plane of incidence 
will be reflected. Now by using a good polariser, if we completely remove 
all the light with its electric vector perpendicular to the plane of incidence 
and let this light be incident on the surface of the prism at Brewster’s 
angle, you will then observe no reflection and there will be total 
transmission of light. 

We began this chapter by pointing out that there are some phenomena 
which can be explained only by the wave theory. In order to develop a 
proper understanding, we first described how some phenomena like 
reflection and refraction, which were studied on this basis of Ray Optics 
in Chapter 9, can also be understood on the basis of Wave Optics. Then 
we described Young’s double slit experiment which was a turning point in 
the study of optics. Finally, we described some associated points such as 
diffraction, resolution, polarisation, and validity of ray optics. In the next 
chapter, you will see how new experiments led to new theories at the turn 
of the century around 1900 A.D. 


Summary 

1. Huygens’ principle tells us that each point on a wavefront is a source of secondary waves, 
which add up to give the wavefront at a later time. 


2. Huygens’ construction tells us that the new wavefront is the forward envelope of the 


secondary waves. When the speed of light is independent of direction, the secondary waves 
are spherical. The rays are then perpendicular to both the wavefronts and the time of travel is 
the same measured along any ray. This principle leads to the well known laws of reflection 
and refraction. 

3. The principle of superposition of waves applies whenever two or more sources of light 
illuminate the same point. When we consider the intensity of light due to these sources at the 
given point, there is an interference term in addition to the sum of the individual intensities. 
But this term is important only if it has a non-zero average, which occurs only if the sources 
have the same frequency and a stable phase difference. 

4. Young’s double slit of separation d gives equally spaced fringes of angular separation A/d. 
The source, mid-point of the slits, and central bright fringe lie in a straight line. An extended 
source will destroy the fringes if it subtends angle more than A/d at the slits. 


5. A single slit of width a gives a diffraction pattern with a central maximum. The intensity falls 

A 2 A 

to zero at angles of ■ , * . etc., with successively weaker secondary maxima in 


a a 

between. Diffraction limits the angular resolution of a telescope to A/D where D is the 
diameter. Two stars closer than this give strongly overlapping images. Similarly, a microscope 
objective subtending angle 2(3 at the focus, in a medium of refractive index n, will just 
separate two objects spaced at a distance A/(2n sin (3), which is the resolution limit of a 
microscope. Diffraction determines the limitations of the concept of light rays. A beam of 


width atravels a distance a 2 /A, called the Fresnel distance, before it starts to spread out due 
to diffraction. 


6. Natural light, e.g., from the sun is unpolarised. This means the 
electric vector takes all possible directions in the transverse plane, 
rapidly and randomly, during a measurement. A polaroid transmits 
only one component (parallel to a special axis). The resulting light is 
called linearly polarised or plane polarised. When this kind of light is 
viewed through a second polaroid whose axis turns through 2tt, two 
maxima and minima of intensity are seen. Polarised light can also be 
produced by reflection at a special angle (called the Brewster angle) 


and by scattering through tt/2 in the earth’s atmosphere. 


Points to Ponder 

1. Waves from a point source spread out in all directions, while light was seen to travel along 
narrow rays. It required the insight and experiment of Huygens, Young and Fresnel to 
understand how a wave theory could explain all aspects of the behaviour of light. 

2. The crucial new feature of waves is interference of amplitudes from different sources which 
can be both constructive and destructive, as shown in Young’s experiment. 

3. Diffraction phenomena define the limits of ray optics. The limit of the ability of microscopes 
and telescopes to distinguish very close objects is set by the wavelength of light. 

4. Most interference and diffraction effects exist even for longitudinal waves like sound in air. 
But polarisation phenomena are special to transverse waves like light waves. 


- \ 

Exercises 

10.1 Monochromatic light of wavelength 589 nm is incident from air 
on a water surface. What are the wavelength, frequency and speed 
of (a) reflected, and (b) refracted light? Refractive index of water is 
1.33. 

10.2 What is the shape of the wavefront in each of the following 
cases: 

(a) Light diverging from a point source. 

(b) Light emerging out of a convex lens when a point source is placed 
at its focus. 

(c) The portion of the wavefront of light from a distant star intercepted 







by the Earth. 

10.3 (a) The refractive index of glass is 1.5. What is the speed of light 
in glass? (Speed of light in vacuum is 3.0 * 10 8 m s _1 ) 

(b) Is the speed of light in glass independent of the colour of light? If 
not, which of the two colours red and violet travels slower in a glass 
prism? 

10.4 In a Young’s double-slit experiment, the slits are separated by 
0.28 mm and the screen is placed 1.4 m away. The distance between 
the central bright fringe and the fourth bright fringe is measured 

to be 1.2 cm. Determine the wavelength of light used in the 
experiment. 

10.5 In Young’s double-slit experiment using monochromatic light of 
wavelength A, the intensity of light at a point on the screen where 
path difference is A, is K units. What is the intensity of light at a point 
where path difference is A/3? 

10.6 A beam of light consisting of two wavelengths, 650 nm and 520 
nm, is used to obtain interference fringes in a Young’s double-slit 
experiment. 

(a) Find the distance of the third bright fringe on the screen from the 
central maximum for wavelength 650 nm. 

(b) What is the least distance from the central maximum where the 
bright fringes due to both the wavelengths coincide? 

10.7 In a double-slit experiment the angular width of a fringe is found 
to be 0.2° on a screen placed 1 m away. The wavelength of light used 
is 600 nm. What will be the angular width of the fringe if the entire 
experimental apparatus is immersed in water? Take refractive index 
of water to be 4/3. 




10.8 What is the Brewster angle for air to glass transition? (Refractive 
index of glass = 1.5.) 

10.9 Light of wavelength 5000 A falls on a plane reflecting surface. 
What are the wavelength and frequency of the reflected light? For 
what angle of incidence is the reflected ray normal to the incident 
ray? 

10.10 Estimate the distance for which ray optics is good 
approximation for an aperture of 4 mm and wavelength 400 nm. 

Additional Exercises 

10.11 The 6563 A Ha line emitted by hydrogen in a star is found to be 
red-shifted by 15 A. Estimate the speed with which the star is 
receding from the Earth. 

10.12 Explain how Corpuscular theory predicts the speed of light in a 
medium, say, water, to be greater than the speed of light in vacuum. 
Is the prediction confirmed by experimental determination of the 
speed of light in water? If not, which alternative picture of light is 
consistent with experiment? 

10.13 You have learnt in the text how Huygens’ principle leads to the 
laws of reflection and refraction. Use the same principle to deduce 
directly that a point object placed in front of a plane mirror produces a 
virtual image whose distance from the mirror is equal to the object 
distance from the mirror. 

10.14 Let us list some of the factors, which could possibly influence 
the speed of wave propagation: 

(i) nature of the source. 

(ii) direction of propagation. 




(iii) motion of the source and/or observer. 

(iv) wavelength. 

(v) intensity of the wave.On which of these factors, if any, does 

(a) the speed of light in vacuum, 

(b) the speed of light in a medium (say, glass or water),depend? 

10.15 For sound waves, the Doppler formula for frequency shift 
differs slightly between the two situations: (i) source at rest; observer 
moving, and (ii) source moving; observer at rest. The exact Doppler 
formulas for the case of light waves in vacuum are, however, strictly 
identical for these situations. Explain why this should be so. Would 
you expect the formulas to be strictly identical for the two situations in 
case of light travelling in a medium? 

10.16 In double-slit experiment using light of wavelength 600 nm, the 
angular width of a fringe formed on a distant screen is 0.1°. What is 
the spacing between the two slits? 

10.17 Answer the following questions: 

(a) In a single slit diffraction experiment, the width of the slit is made 
double the original width. How does this affect the size and intensity 
of the central diffraction band? 

(b) In what way is diffraction from each slit related to the interference 
pattern in a double-slit experiment? 

(c) When a tiny circular obstacle is placed in the path of light from a 
distant source, a bright spot is seen at the centre of the shadow of 
the obstacle. Explain why? 

(d) Two students are separated by a 7 m partition wall in a room 

10 m high. If both light and sound waves can bend around obstacles, 




how is it that the students are unable to see each other even though 
they can converse easily. 

(e) Ray optics is based on the assumption that light travels in a 
straight line. Diffraction effects (observed when light propagates 
through small apertures/slits or around small obstacles) disprove this 
assumption. Yet the ray optics assumption is so commonly used in 
understanding location and several other properties of images in 
optical instruments. What is the justification? 

10.18 Two towers on top of two hills are 40 km apart. The line joining 
them passes 50 m above a hill halfway between the towers. What is 
the longest wavelength of radio waves, which can be sent between 
the towers without appreciable diffraction effects? 

10.19 A parallel beam of light of wavelength 500 nm falls on a narrow 
slit and the resulting diffraction pattern is observed on a screen 1 m 
away. It is observed that the first minimum is at a distance of 2.5 mm 
from the centre of the screen. Find the width of the slit. 

10.20 Answer the following questions: 

(a) When a low flying aircraft passes overhead, we sometimes notice 
a slight shaking of the picture on our TV screen. Suggest a possible 
explanation. 

(b) As you have learnt in the text, the principle of linear superposition 
of wave displacement is basic to understanding intensity distributions 
in diffraction and interference patterns. What is the justification of this 
principle? 

10.21 In deriving the single slit diffraction pattern, it was stated that 
the intensity is zero at angles of nA/a. Justify this by suitably dividing 
the slit to bring out the cancellation. 


V. 
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Chapter Eleven 


Dual Nature of Radiation and 
Matter 


11.1 Introduction 

The Maxwell’s equations of electromagnetism and Hertz experiments on 
the generation and detection of electromagnetic waves in 1887 strongly 
established the wave nature of light. Towards the same period at the end 
of 19th century, experimental investigations on conduction of electricity 
(electric discharge) through gases at low pressure in a discharge tube led 
to many historic discoveries. The discovery of X-rays by Roentgen in 
1895, and of electron by J. J. Thomson in 1897, were important 
milestones in the understanding of atomic structure. It was found that at 
sufficiently low pressure of about 0.001 mm of mercury column, a 
discharge took place between the two electrodes on applying the electric 
field to the gas in the discharge tube. A fluorescent glow appeared on the 
glass opposite to cathode. The colour of glow of the glass depended on 
the type of glass, it being yellowish-green for soda glass. The cause of 
this fluorescence was attributed to the radiation which appeared to be 
coming from the cathode. These cathode rays were discovered, in 1870, 
by William Crookes who later, in 1879, suggested that these rays 
consisted of streams of fast moving negatively charged particles. The 
British physicist J. J. Thomson (1856-1940) confirmed this hypothesis. By 
applying mutually perpendicular electric and magnetic fields across the 
discharge tube, J. J. Thomson was the first to determine experimentally 
the speed and the specific charge [charge to mass ratio (e/m)] of the 




cathode ray particles. They were found to travel with speeds ranging from 
about 0.1 to 0.2 times the speed of light (3 xIO 8 m/s). The presently 

accepted value of e/m is 1.76 * 10 11 C/kg. Further, the value of e/m was 
found to be independent of the nature of the material/metal used as the 
cathode (emitter), or the gas introduced in the discharge tube. This 
observation suggested the universality of the cathode ray particles. 

Around the same time, in 1887, it was found that certain metals, when 
irradiated by ultraviolet light, emitted negatively charged particles having 
small speeds. Also, certain metals when heated to a high temperature 
were found to emit negatively charged particles. The value of e/m of 
these particles was found to be the same as that for cathode ray 
particles. These observations thus established that all these particles, 
although produced under different conditions, were identical in nature. J. 
J. Thomson, in 1897, named these particles as electrons, and suggested 
that they were fundamental, universal constituents of matter. For his 
epoch-making discovery of electron, through his theoretical and 
experimental investigations on conduction of electricity by gasses, he 
was awarded the Nobel Prize in Physics in 1906. In 1913, the American 
physicist R. A. Millikan (1868-1953) performed the pioneering oil-drop 
experiment for the precise measurement of the charge on an electron. He 
found that the charge on an oil-droplet was always an integral multiple of 

an elementary charge, 1.602 x 10 -19 C. Millikan’s experiment established 
that electric charge is quantised. From the values of charge (e) and 
specific charge (e/m), the mass (m) of the electron could be determined. 

11.2 Electron Emission 

We know that metals have free electrons (negatively charged particles) 
that are responsible for their conductivity. However, the free electrons 


cannot normally escape out of the metal surface. If an electron attempts 
to come out of the metal, the metal surface acquires a positive charge 
and pulls the electron back to the metal. The free electron is thus held 
inside the metal surface by the attractive forces of the ions. 
Consequently, the electron can come out of the metal surface only if it 
has got sufficient energy to overcome the attractive pull. A certain 
minimum amount of energy is required to be given to an electron to pull it 
out from the surface of the metal. This minimum energy required by an 
electron to escape from the metal surface is called the work function of 
the metal. It is generally denoted by cp 0 and measured in ev (electron 

volt). One electron volt is the energy gained by an electron when it has 
been accelerated by a potential difference of 1 volt, so that 1 eV = 1.602 
x10“ 19 J. 

This unit of energy is commonly used in atomic and nuclear physics. The 
work function (cp 0 ) depends on the properties of the metal and the nature 

of its surface. The values of work function of some metals are given in 
Table 11.1. These values are approximate as they are very sensitive to 
surface impurities. 

Note from Table 11.1 that the work function of platinum is the highest (cp 0 
= 5.65 eV) while it is the lowest (cp 0 = 2.14 eV) for caesium. 

The minimum energy required for the electron emission from the metal 
surface can be supplied to the free electrons by any one of the following 
physical processes: 



Table 11.1 Work functions of some metals 

Metal 

Work function 
t l«VJ 

Metal 

Work function 
(«V) 

Cs 

2.14 

A1 

4.28 

K 

2.30 

Hg 

4.49 

Na 

2.75 

Cu 

4.65 

Ca 

3.20 

Ag 

4.70 

Mo 

4.17 

Nl 

5.15 

Pb 

4.25 

Pt 

5.65 


(i) Thermionic emission: By suitably heating, sufficient thermal energy 
can be imparted to the free electrons to enable them to come out of the 
metal. 

(ii) Field emission : By applying a very strong electric field (of the order of 

10 8 V rrr 1 ) to a metal, electrons can be pulled out of the metal, as in a 
spark plug. 

(iii) Photoelectric emission-. When light of suitable frequency illuminates a 
metal surface, electrons are emitted from the metal surface. These 
photo(light)-generated electrons are called photoelectrons. 

11.3 Photoelectric Effect 

11.3.1 Hertz’s observations 

The phenomenon of photoelectric emission was discovered in 1887 by 
Heinrich Hertz (1857-1894), during his electromagnetic wave 
experiments. In his experimental investigation on the production of 
electromagnetic waves by means of a spark discharge, Hertz observed 
that high voltage sparks across the detector loop were enhanced when 
the emitter plate was illuminated by ultraviolet light from an arc lamp. 











Light shining on the metal surface somehow facilitated the escape of 
free, charged particles which we now know as electrons. When light falls 
on a metal surface, some electrons near the surface absorb enough 
energy from the incident radiation to overcome the attraction of the 
positive ions in the material of the surface. After gaining sufficient energy 
from the incident light, the electrons escape from the surface of the metal 
into the surrounding space. 

11.3.2 Hallwachs’ and Lenard’s observations 

Wilhelm Hallwachs and Philipp Lenard investigated the phenomenon of 
photoelectric emission in detail during 1886-1902. 

Lenard (1862-1947) observed that when ultraviolet radiations were 
allowed to fall on the emitter plate of an evacuated glass tube enclosing 
two electrodes (metal plates), current flows in the circuit (Fig. 11.1). As 
soon as the ultraviolet radiations were stopped, the current flow also 
stopped. These observations indicate that when ultraviolet radiations fall 
on the emitter plate C, electrons are ejected from it which are attracted 
towards the positive, collector plate A by the electric field. The electrons 
flow through the evacuated glass tube, resulting in the current flow. Thus, 
light falling on the surface of the emitter causes current in the external 
circuit. Hallwachs and Lenard studied how this photo current varied with 
collector plate potential, and with frequency and intensity of incident light. 

Hallwachs, in 1888, undertook the study further and connected a 
negatively charged zinc plate to an electroscope. He observed that the 
zinc plate lost its charge when it was illuminated by ultraviolet light. 
Further, the uncharged zinc plate became positively charged when it was 
irradiated by ultraviolet light. Positive charge on a positively charged zinc 
plate was found to be further enhanced when it was illuminated by 


ultraviolet light. From these observations he concluded that negatively 
charged particles were emitted from the zinc plate under the action of 
ultraviolet light. 

After the discovery of the electron in 1897, it became evident that the 
incident light causes electrons to be emitted from the emitter plate. Due 
to negative charge, the emitted electrons are pushed towards the 
collector plate by the electric field. Hallwachs and Lenard also observed 
that when ultraviolet light fell on the emitter plate, no electrons were 
emitted at all when the frequency of the incident light was smaller than a 
certain minimum value, called the threshold frequency. This minimum 
frequency depends on the nature of the material of the emitter plate. 

It was found that certain metals like zinc, cadmium, magnesium, etc., 
responded only to ultraviolet light, having short wavelength, to cause 
electron emission from the surface. However, some alkali metals such as 
lithium, sodium, potassium, caesium and rubidium were sensitive even to 
visible light. All these photosensitive substances emit electrons when 
they are illuminated by light. After the discovery of electrons, these 
electrons were termed as photoelectrons. The phenomenon is called 
photoelectric effect. 

11.4 Experimental Study of Photoelectric Effect 

Figure 11.1 depicts a schematic view of the arrangement used for the 
experimental study of the photoelectric effect. It consists of an evacuated 
glass/quartz tube having a photosensitive plate C and another metal 
plate A. Monochromatic light from the source S of sufficiently short 
wavelength passes through the window W and falls on the photosensitive 
plate C (emitter). A transparent quartz window is sealed on to the glass 
tube, which permits ultraviolet radiation to pass through it and irradiate 
the photosensitive plate C. The electrons are emitted by the plate C and 


are collected by the plate A (collector), by the electric field created by the 
battery. The battery maintains the potential difference between the plates 
C and A, that can be varied. The polarity of the plates C and A can be 
reversed by a commutator. Thus, the plate A can be maintained at a 
desired positive or negative potential with respect to emitter C. When the 
collector plate A is positive with respect to the emitter plate C, the 
electrons are attracted to it. The emission of electrons causes flow of 
electric current in the circuit. The potential difference between the emitter 
and collector plates is measured by a voltmeter (V) whereas the resulting 
photo current flowing in the circuit is measured by a microammeter (pA). 
The photoelectric current can be increased or decreased by varying the 
potential of collector plate A with respect to the emitter plate C. The 
intensity and frequency of the incident light can be varied, as can the 

potential difference V between the emitter C and the collector A. 

Quartz 

window 



11.4.1 Effect of intensity of light on photocurrent 
























The collector A is maintained at a positive potential with respect to emitter 
C so that electrons ejected from C are attracted towards collector A. 
Keeping the frequency of the incident radiation and the potential fixed, 
the intensity of light is varied and the resulting photoelectric current is 
measured each time. It is found that the photocurrent increases linearly 
with intensity of incident light as shown graphically in Fig. 11.2. The 
photocurrent is directly proportional to the number of photoelectrons 
emitted per second. This implies that the number of photoelectrons 
emitted per second is directly proportional to the intensity of incident 
radiation. 



Intensity of light-► 

Figure 11.2 Variation of Photoelectric current with intensity of light. 


11.4.2 Effect of potential on photoelectric current 

We first keep the plate A at some positive potential with respect to the 
plate C and illuminate the plate C with light of fixed frequency v and fixed 
intensity l-\. We next vary the positive potential of plate A gradually and 

measure the resulting photocurrent each time. It is found that the 






photoelectric current increases with increase in positive (accelerating) 
potential. At some stage, for a certain positive potential of plate A, all the 
emitted electrons are collected by the plate A and the photoelectric 
current becomes maximum or saturates. If we increase the accelerating 
potential of plate A further, the photocurrent does not increase. This 
maximum value of the photoelectric current is called saturation current. 
Saturation current corresponds to the case when all the photoelectrons 
emitted by the emitter plate C reach the collector plate A. 

We now apply a negative (retarding) potential to the plate A with respect 
to the plate C and make it increasingly negative gradually. When the 
polarity is reversed, the electrons are repelled and only the most 
energetic electrons are able to reach the collector A. The photocurrent is 
found to decrease rapidly until it drops to zero at a certain sharply 
defined, critical value of the negative potential \/ 0 on the plate A. For a 

particular frequency of incident radiation, the minimum negative 
(retarding) potential \/ 0 given to the plate A for which the photocurrent 

stops or becomes zero is called the cut-off or stopping potential. 




+ -Retarding potential Collector plate -► 

potential 

Figure 11.3 Variation of photocurrent with collector plate potential for different 

intensity of incident radiation. 

The interpretation of the observation in terms of photoelectrons is 
straightforward. All the photoelectrons emitted from the metal do not have 
the same energy. Photoelectric current is zero when the stopping 
potential is sufficient to repel even the most energetic photoelectrons, 
with the maximum kinetic energy (K max ), so that 

K ma , = eV 0 ( 11 . 1 ) 

We can now repeat this experiment with incident radiation of the same 
frequency but of higher intensity l 2 and / 3 (/ 3 > l 2 > l-\). We note that the 

saturation currents are now found to be at higher values. This shows that 
more electrons are being emitted per second, proportional to the intensity 
of incident radiation. But the stopping potential remains the same as that 
for the incident radiation of intensity / 1 , as shown graphically in Fig. 11.3. 










Thus, for a given frequency of the incident radiation, the stopping 
potential is independent of its intensity. In other words, the maximum 
kinetic energy of photoelectrons depends on the light source and the 
emitter plate material, but is independent of intensity of incident radiation. 

11.4.3 Effect of frequency of incident radiation on stopping potential 

We now study the relation between the frequency v of the incident 
radiation and the stopping potential V 0 . We suitably adjust the same 

intensity of light radiation at various frequencies and study the variation of 
photocurrent with collector plate potential. The resulting variation is 
shown in Fig. 11.4. We obtain different values of stopping potential but 
the same value of the saturation current for incident radiation of different 
frequencies. The energy of the emitted electrons depends on the 
frequency of the incident radiations. The stopping potential is more 
negative for higher frequencies of incident radiation. Note from Fig. 11.4 
that the stopping potentials are in the order V 03 > V 02 > \/ 01 if the 

frequencies are in the order v 3 > v 2 > v 1 . This implies that greater the 

frequency of incident light, greater is the maximum kinetic energy of the 
photoelectrons. Consequently, we need greater retarding potential to stop 
them completely. If we plot a graph between the frequency of incident 
radiation and the corresponding stopping potential for different metals we 
get a straight line, as shown in Fig. 11.5. 



Figure 11.4 Variation of photoelectric current with collector plate potential for different frequencies 

of incident radiation. 



Figure 11.5 Variation of stopping potential VO with frequency v of incident radiation fora given 

photosensitive material. 


The graph shows that 

(i) the stopping potential V 0 varies linearly with the frequency of incident 
radiation for a given photosensitive material. 

(ii) there exists a certain minimum cut-off frequency v 0 for which the 
stopping potential is zero. 









These observations have two implications: 

(i) The maximum kinetic energy of the photoelectrons varies linearly with 
the frequency of incident radiation, but is independent of its intensity. 

(ii) For a frequency v of incident radiation, lower than the cut-off 
frequency v 0 , no photoelectric emission is possible even if the intensity is 

large. 

This minimum, cut-off frequency v 0 , is called the threshold frequency. It is 
different for different metals. 

Different photosensitive materials respond differently to light. Selenium is 
more sensitive than zinc or copper. The same photosensitive substance 
gives different response to light of different wavelengths. For example, 
ultraviolet light gives rise to photoelectric effect in copper while green or 
red light does not. 

Note that in all the above experiments, it is found that, if frequency of the 
incident radiation exceeds the threshold frequency, the photoelectric 
emission starts instantaneously without any apparent time lag, even if the 
incident radiation is very dim. It is now known that emission starts in a 

time of the order of 10 -9 s or less. 

We now summarise the experimental features and observations 
described in this section. 

(i) For a given photosensitive material and frequency of incident radiation 
(above the threshold frequency), the photoelectric current is directly 
proportional to the intensity of incident light (Fig. 11.2). 

(ii) For a given photosensitive material and frequency of incident 
radiation, saturation current is found to be proportional to the intensity of 
incident radiation whereas the stopping potential is independent of its 



intensity (Fig. 11.3). 

(iii) For a given photosensitive material, there exists a certain minimum 
cut-off frequency of the incident radiation, called the threshold frequency, 
below which no emission of photoelectrons takes place, no matter how 
intense the incident light is. Above the threshold frequency, the stopping 
potential or equivalently the maximum kinetic energy of the emitted 
photoelectrons increases linearly with the frequency of the incident 
radiation, but is independent of its intensity (Fig. 11.5). 

(iv) The photoelectric emission is an instantaneous process without any 

apparent time lag (~10 -9 s or less), even when the incident radiation is 
made exceedingly dim. 


11.5 Photoelectric Effect and Wave Theory oF Light 

The wave nature of light was well established by the end of the 
nineteenth century. The phenomena of interference, diffraction and 
polarisation were explained in a natural and satisfactory way by the wave 
picture of light. According to this picture, light is an electromagnetic wave 
consisting of electric and magnetic fields with continuous distribution of 
energy over the region of space over which the wave is extended. Let us 
now see if this wave picture of light can explain the observations on 
photoelectric emission given in the previous section. 

According to the wave picture of light, the free electrons at the surface of 
the metal (over which the beam of radiation falls) absorb the radiant 
energy continuously. The greater the intensity of radiation, the greater are 
the amplitude of electric and magnetic fields. Consequently, the greater 
the intensity, the greater should be the energy absorbed by each 
electron. In this picture, the maximum kinetic energy of the 


photoelectrons on the surface is then expected to increase with increase 
in intensity. Also, no matter what the frequency of radiation is, a 
sufficiently intense beam of radiation (over sufficient time) should be able 
to impart enough energy to the electrons, so that they exceed the 
minimum energy needed to escape from the metal surface . A threshold 
frequency, therefore, should not exist. These expectations of the wave 
theory directly contradict observations (i), (ii) and (iii) given at the end of 
sub-section 11.4.3. 

Further, we should note that in the wave picture, the absorption of energy 
by electron takes place continuously over the entire wavefront of the 
radiation. Since a large number of electrons absorb energy, the energy 
absorbed per electron per unit time turns out to be small. Explicit 
calculations estimate that it can take hours or more for a single electron 
to pick up sufficient energy to overcome the work function and come out 
of the metal. This conclusion is again in striking contrast to observation 
(iv) that the photoelectric emission is instantaneous. In short, the wave 
picture is unable to explain the most basic features of photoelectric 
emission. 

11.6 Einstein’s Photoelectric Equation: Energy Quantum of 
Radiation 

In 1905, Albert Einstein (1879-1955) proposed a radically new picture of 
electromagnetic radiation to explain photoelectric effect. In this picture, 
photoelectric emission does not take place by continuous absorption of 
energy from radiation. Radiation energy is built up of discrete units - the 
so called quanta of energy of radiation. Each quantum of radiant energy 
has energy /7v, where h is Planck’s constant and v the frequency of light. 
In photoelectric effect, an electron absorbs a quantum of energy (A?v) of 
radiation. If this quantum of energy absorbed exceeds the minimum 


energy needed for the electron to escape from the metal surface (work 
function cp 0 ), the electron is emitted with maximum kinetic energy 

knax = hv - <Po (11-2) 

More tightly bound electrons will emerge with kinetic energies less than 
the maximum value. Note that the intensity of light of a given frequency is 
determined by the number of photons incident per second. Increasing the 
intensity will increase the number of emitted electrons per second. 
However, the maximum kinetic energy of the emitted photoelectrons is 
determined by the energy of each photon. 


Albert Einstein (1879 - 1955) 



Albert Einstein (1879 - 1955) Einstein, one of the greatest physicists of all time, was born in 
Ulm, Germany. In 1905, he published three path-breaking papers. In the first paper, he 
introduced the notion of light quanta (now called photons) and used it to explain the features 
of photoelectric effect. In the second paper, he developed a theory of Brownian motion, 
confirmed experimentally a few years later and provided a convincing evidence of the atomic 
picture of matter. The third paper gave birth to the special theory of relativity. In 1916, he 
published the general theory of relativity. Some of Einstein’s most significant later 
contributions are: the notion of stimulated emission introduced in an alternative derivation of 


Planck’s blackbody radiation law, static model of the universe which started modern 
cosmology, quantum statistics of a gas of massive bosons, and a critical analysis of the 
foundations of quantum mechanics. In 1921, he was awarded the Nobel Prize in physics for 
his contribution to theoretical physics and the photoelectric effect. 


Equation (11.2) is known as Einstein’s photoelectric equation. We now 
see how this equation accounts in a simple and elegant manner all the 
observations on photoelectric effect given at the end of sub-section 
11.4.3. 

• According to Eq. (11.2), K max depends linearly on v, and is independent 

of intensity of radiation, in agreement with observation. This has 
happened because in Einstein’s picture, photoelectric effect arises from 
the absorption of a single quantum of radiation by a single electron. The 
intensity of radiation (that is proportional to the number of energy quanta 
per unit area per unit time) is irrelevant to this basic process. 

• Since K max must be non-negative, Eq. (11.2 ) implies that photoelectric 
emission is possible only if 

h v > cp 0 

or v > v 0 , where 

v 0 = f' (11.3) 

h 

Equation (11.3) shows that the greater the work function cp 0 , the higher 
the minimum or threshold frequency v 0 needed to emit photoelectrons. 
Thus, there exists a threshold frequency v 0 (= cp 0 /h) for the metal surface, 
below which no photoelectric emission is possible, no matter how intense 


the incident radiation may be or how long it falls on the surface. 

• In this picture, intensity of radiation as noted above, is proportional to 
the number of energy quanta per unit area per unit time. The greater the 
number of energy quanta available, the greater is the number of 
electrons absorbing the energy quanta and greater, therefore, is the 
number of electrons coming out of the metal (for v > v 0 ). This explains 

why, for v > v 0 , photoelectric current is proportional to intensity. 

• In Einstein’s picture, the basic elementary process involved in 
photoelectric effect is the absorption of a light quantum by an electron. 
This process is instantaneous. Thus, whatever may be the intensity i.e., 
the number of quanta of radiation per unit area per unit time, 
photoelectric emission is instantaneous. Low intensity does not mean 
delay in emission, since the basic elementary process is the same. 
Intensity only determines how many electrons are able to participate in 
the elementary process (absorption of a light quantum by a single 
electron) and, therefore, the photoelectric current. 

Using Eq. (11.1), the photoelectric equation, Eq. (11.2), can be written as 

e \/ 0 = /7 v - cp 0 ; for i' > v u 

or V 0 = ([i L _ & (11.4) 

i. c. > e 

This is an important result. It predicts that the \/ 0 versus v curve is a 

straight line with slope = {hie), independent of the nature of the material. 
During 1906-1916, Millikan performed a series of experiments on 
photoelectric effect, aimed at disproving Einstein’s photoelectric equation. 
He measured the slope of the straight line obtained for sodium, similar to 
that shown in Fig. 11.5. Using the known value of e, he determined the 
value of Planck’s constant h. This value was close to the value of 



Planck’s contant (= 6.626 * 10 —34 J s) determined in an entirely different 
context. In this way, in 1916, Millikan proved the validity of Einstein’s 
photoelectric equation, instead of disproving it. 

The successful explanation of photoelectric effect using the hypothesis of 
light quanta and the experimental determination of values of h and cp 0 , in 

agreement with values obtained from other experiments, led to the 
acceptance of Einstein’s picture of photoelectric effect. Millikan verified 
photoelectric equation with great precision, for a number of alkali metals 
over a wide range of radiation frequencies. 

11.7 Particle Nature of Light: The Photon 

Photoelectric effect thus gave evidence to the strange fact that light in 
interaction with matter behaved as if it was made of quanta or packets of 
energy, each of energy h v. 

Is the light quantum of energy to be associated with a particle? Einstein 
arrived at the important result, that the light quantum can also be 
associated with momentum (h v/c). A definite value of energy as well as 
momentum is a strong sign that the light quantum can be associated with 
a particle. This particle was later named photon. The particle-like 
behaviour of light was further confirmed, in 1924, by the experiment of 
A.H. Compton (1892-1962) on scattering of X-rays from electrons. In 
1921, Einstein was awarded the Nobel Prize in Physics for his 
contribution to theoretical physics and the photoelectric effect. In 1923, 
Millikan was awarded the Nobel Prize in physics for his work on the 
elementary charge of electricity and on the photoelectric effect. 

We can summarise the photon picture of electromagnetic radiation as 
follows: 


(i) In interaction of radiation with matter, radiation behaves as if it is made 
up of particles called photons. 

(ii) Each photon has energy E ( =hv ) and momentum p (= h v/c), and 
speed c, the speed of light. 

(iii) All photons of light of a particular frequency v, or wavelength A, have 
the same energy E (=Pv = hc/K) and momentum p (= hWc= h/K), whatever 
the intensity of radiation may be. By increasing the intensity of light of 
given wavelength, there is only an increase in the number of photons per 
second crossing a given area, with each photon having the same energy. 
Thus, photon energy is independent of intensity of radiation. 

(iv) Photons are electrically neutral and are not deflected by electric and 
magnetic fields. 

(v) In a photon-particle collision (such as photon-electron collision), the 
total energy and total momentum are conserved. However, the number of 
photons may not be conserved in a collision. The photon may be 
absorbed or a new photon may be created. 


1 A 

Example 11.1 Monochromatic light of frequency 6.0 xio 1 ^ Hz is produced by a laser. The 

power emitted is 2.0 xio ° W. (a) What is the energy of a photon in the light beam? (b) How 
many photons per second, on an average, are emitted by the source? 

Solution 

(a) Each photon has an energy 
E = h v = ( 6.63 x10“ 34 J s) (6.0 xlO 14 Hz) 

= 3.98 x 10“ 19 J 




(b) If N is the number of photons emitted by the source per second, the power P transmitted 
in the beam equals N times the energy per photon E, so that P = N E. Then 



2.0x10 : YV 
3,08x10 ' J 


= 5.0 xIO 15 photons per second. 

Example 11.2 The work function of caesium is 2.14 eV. Find (a) the threshold frequency for 
caesium, and (b) the wavelength of the incident light if the photocurrent is brought to zero by 
a stopping potential of 0.60 V. 

Solution 


(a) For the cut-off or threshold frequency, the energy h v 0 of the incident radiation must be 
equal to work function (p 0 , so that 


v o = 


e 


2. I I t'V 


h ti.n:;> 10 \j 


2.14 x 1.6x 1Q~ 19 J 
6.63xlO _G4 J s 


5.16 x 10 14 Hz 


Thus, for frequencies less than this threshold frequency, no photoelectrons are ejected. 


(b) Photocurrent reduces to zero, when maximum kinetic energy of the emitted 
photoelectrons equals the potential energy e V 0 by the retarding potential V 0 . Einstein’s 

Photoelectric equation is 

/ 1C 

eV 0 = hv - cp 0 = - cp 0 

2 

or, A = hc/(eV 0 + cp 0 ) 




(8.83 x 1CT' 4 J s) x(3 x 10 m jii/ s) 
(0,60 eV + 2,14 eV) 

10.60 ^ lu "J in 
(2.71 eVJ 


10.80 xlU _sc J m 
2.74 x 1.6 xlO- 1 -J 


- 154 nm 


Example 11.3 The wavelength of light in the visible region is about 390 nm for violet colour, 
about 550 nm (average wavelength) for yellow-green colour and about 760 nm for red colour. 


(a) What are the energies of photons in (eV) at the (i) violet end, (ii) average wavelength, 
yellow-green colour, and (Hi) red end of the visible spectrum? (Take h = 6.63><10 -34 J s and 1 
eV = 1.6x10 _19 J.) 


(b) From which of the photosensitive materials with work functions listed in Table 11.1 and 
using the results of (i), (ii) and (iii) of (a), can you build a photoelectric device that operates 
with visible light? 


Solution 


(a) Energy of the incident photon, E = hv = hc/A 
E = (6.63x10" 34 J s) (3x10 8 m/s)/A 

_ 1.989x10 Jm 

A 

(i) For violet light, A 1 = 390 nm (lower wavelength end) 

1,9H9 x 10 - ! ' J m 

Incident photon energy, E 1 = 

390x 1 0 : ni 


= 5.10 x 10“ 19 J 




1.6x10 J/eV 

(ii) For yellow-green light, h 2 = 550 nm (average wavelength) 

1 .959 x 10 - ! ' J m 

Incident photon energy, E 2 = 

550x10 in 

= 3.62x10 -19 J = 2.26 eV 

(iii) For red light, A 3 = 760 nm (higher wavelength end) 

1 .959 x 10 - ! ' 9 m 

Incident photon energy, E 3 = 

760x 10 in 

= 2.62x10 -19 J = 1.64 eV 

(b) For a photoelectric device to operate, we require incident light energy E to be equal to or 
greater than the work function <p 0 of the material. Thus, the photoelectric device will operate 

with violet light (with 

E = 3.19 eV) photosensitive material Na (with <p 0 = 2.75 eV), K (with 

cp 0 = 2.30 eV) and Cs (with cp 0 = 2.14 eV). It will also operate with yellow-green light (with E = 
2.26 eV) for Cs (with (p 0 = 2.14 eV) only. However, it will not operate with red light (with E = 
1.64 eV) for any of these photosensitive materials. 


11.8 Wave Nature of Matter 

The dual (wave-particle) nature of light (electromagnetic radiation, in 
general) comes out clearly from what we have learnt in this and the 
preceding chapters. The wave nature of light shows up in the phenomena 
of interference, diffraction and polarisation. On the other hand, in 






photoelectric effect and Compton effect which involve energy and 
momentum transfer, radiation behaves as if it is made up of a bunch of 
particles - the photons. Whether a particle or wave description is best 
suited for understanding an experiment depends on the nature of the 
experiment. For example, in the familiar phenomenon of seeing an object 
by our eye, both descriptions are important. The gathering and focussing 
mechanism of light by the eye-lens is well described in the wave picture. 
But its absorption by the rods and cones (of the retina) requires the 
photon picture of light. 

A natural question arises: If radiation has a dual (wave-particle) nature, 
might not the particles of nature (the electrons, protons, etc.) also exhibit 
wave-like character? In 1924, the French physicist Louis Victor de Broglie 
(pronounced as de Broy) (1892-1987) put forward the bold hypothesis 
that moving particles of matter should display wave-like properties under 
suitable conditions. He reasoned that nature was symmetrical and that 
the two basic physical entities - matter and energy, must have 
symmetrical character. If radiation shows dual aspects, so should matter. 
De Broglie proposed that the wave length A associated with a particle of 
momentum p is given as 

h h 

A = — = - (11.5) 

p m v 

where m is the mass of the particle and v its speed. Equation (11.5) is 
known as the de Broglie relation and the wavelength A of the matter wave 
is called de Broglie wavelength. The dual aspect of matter is evident in 
the de Broglie relation. On the left hand side of Eq. (11.5), A is the 
attribute of a wave while on the right hand side the momentum p is a 
typical attribute of a particle. Planck’s constant h relates the two 
attributes. 



Equation (11.5) for a material particle is basically a hypothesis whose 
validity can be tested only by experiment. However, it is interesting to see 
that it is satisfied also by a photon. For a photon, as we have seen, 

p = hv /c (11.6) 

Therefore, 

h c , 

= A (11.7) 

P i' 

That is, the de Broglie wavelength of a photon given by Eq. (11.5) equals 
the wavelength of electromagnetic radiation of which the photon is a 
quantum of energy and momentum. 

Clearly, from Eq. (11.5 ), A is smaller for a heavier particle (large m) or 
more energetic particle (large v). For example, the de Broglie wavelength 

of a ball of mass 0.12 kg moving with a speed of 20 m s -1 is easily 
calculated: 

Photocell 

A photocell is a technological application of the photoelectric effect. It is a 
device whose electrical properties are affected by light. It is also 
sometimes called an electric eye. A photocell consists of a semi- 
cylindrical photo-sensitive metal plate C (emitter) and a wire loop A 
(collector) supported in an evacuated glass or quartz bulb. It is connected 
to the external circuit having a high-tension battery B and microammeter 
(pA) as shown in the Figure. Sometimes, instead of the plate C, a thin 
layer of photosensitive material is pasted on the inside of the bulb. A part 
of the bulb is left clean for the light to enter it. 


A photo cell 


When light of suitable wavelength falls on the emitter C, photoelectrons are emitted. These 
photoelectrons are drawn to the collector A. Photocurrent of the order of a few microampere 
can be normally obtained from a photo cell. 


A photocell converts a change in intensity of illumination into a change in photocurrent. This 
current can be used to operate control systems and in light measuring devices. A photocell of 
lead sulphide sensitive to infrared radiation is used in electronic ignition circuits. 


Emitter 



A Photo cell 


In scientific work, photo cells are used whenever it is necessary to measure the intensity of 
light. Light meters in photographic cameras make use of photo cells to measure the intensity 
of incident light. The photocells, inserted in the door light electric circuit, are used as 
automatic door opener. A person approaching a doorway may interrupt a light beam which is 
incident on a photocell. The abrupt change in photocurrent may be used to start a motor 
which opens the door or rings an alarm. They are used in the control of a counting device 


















which records every interruption of the light beam caused by a person or object passing 
across the beam. So photocells help count the persons entering an auditorium, provided they 
enter the hall one by one. They are used for detection of traffic law defaulters: an alarm may 
be sounded whenever a beam of (invisible) radiation is intercepted. 

In burglar alarm, (invisible) ultraviolet light is continuously made to fall on a photocell installed 
at the doorway. A person entering the door interrupts the beam falling on the photocell. The 
abrupt change in photocurrent is used to start an electric bell ringing. In fire alarm, a number 
of photocells are installed at suitable places in a building. In the event of breaking out of fire, 
light radiations fall upon the photocell. This completes the electric circuit through an electric 
bell or a siren which starts operating as a warning signal. 

Photocells are used in the reproduction of sound in motion pictures and in the television 
camera for scanning and telecasting scenes. They are used in industries for detecting minor 
flaws or holes in metal sheets. 


p = m v = 0.12 kg x 20 m s -1 = 2.40 kg m s _1 


A = 


h 

P 


6.63 x ICT"' 1 d s 

2.40 kg in s 1 


= 2.76 x 10 -34 m 


This wavelength is so small that it is beyond any measurement. This is 
the reason why macroscopic objects in our daily life do not show wave¬ 
like properties. On the other hand, in the sub-atomic domain, the wave 
character of particles is significant and measurable. 

Consider an electron (mass m, charge e) accelerated from rest through a 
potential V. The kinetic energy K 

of the electron equals the work done (e\/) on it by the electric field: 
K=eV (11.8) 


, so that 


1 o P 

Now , K= -mv 2 = 

2 2 177 

p= y/2 m K = yfe m eV (11.9) 

The de Broglie wavelength A of the electron is then 
h h h 

K ~ p~ sj2 mK~ ^2 meV (11-10) 

Substituting the numerical values of h, m, e, we get 

. 1.227 

X - —urn (11.11) 

4V 

where V is the magnitude of accelerating potential in volts. For a 120 V 
accelerating potential, Eq. (11.11) gives A = 0.112 nm. This wavelength is 
of the same order as the spacing between the atomic planes in crystals. 
This suggests that matter waves associated with an electron could be 
verified by crystal diffraction experiments analogous to X-ray diffraction. 
We describe the experimental verification of the de Broglie hypothesis in 
the next section. In 1929, de Broglie was awarded the Nobel Prize in 
Physics for his discovery of the wave nature of electrons. 









Louis Victor de Broglie (1892 - 1987) French physicist who put forth revolutionary idea of 
wave nature of matter. This idea was developed by Erwin Schrodinger into a full-fledged 
theory of quantum mechanics commonly known as wave mechanics. In 1929, he was 
awarded the Nobel Prize in Physics for his discovery of the wave nature of electrons. 


The matter-wave picture elegantly incorporated the Heisenberg’s 
uncertainty principle. According to the principle, it is not possible to 
measure both the position and momentum of an electron (or any other 
particle) at the same time exactly. There is always some uncertainty (A x) 
in the specification of position and some uncertainty (Ap) in the 
specification of momentum. The product of Ax and A p is of the order of 6* 
(with 6 = /7/2 tt), i.e., 

Ax Ap « 6 (11.12) 

Equation (11.12) allows the possibility that Ax is zero; but then A p must 
be infinite in order that the product is non-zero. Similarly, if Ap is zero, Ax 
must be infinite. Ordinarily, both Ax and Ap are non-zero such that their 
product is of the order of 6. 

Now, if an electron has a definite momentum p, (i.e.Ap = 0), by the de 
Broglie relation, it has a definite wavelength A. A wave of definite (single) 
wavelength extends all over space. By Born’s probability interpretation 
this means that the electron is not localised in any finite region of space. 
That is, its position uncertainty is infinite (Ax —> °°), which is consistent 
with the uncertainty principle. 

* A more rigorous treatment gives Ax Ap > o nf /2. 

In general, the matter wave associated with the electron is not extended 
all over space. It is a wave packet extending over some finite region of 
space. In that case Ax is not infinite but has some finite value depending 


on the extension of the wave packet. Also, you must appreciate that a 
wave packet of finite extension does not have a single wavelength. It is 
built up of wavelengths spread around some central wavelength. 

By de Broglie’s relation, then, the momentum of the electron will also 
have a spread - an uncertainty Ap. This is as expected from the 
uncertainty principle. It can be shown that the wave packet description 
together with de Broglie relation and Born’s probability interpretation 
reproduce the Heisenberg’s uncertainty principle exactly. 
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Figure 11.6 (a) The wave packet description of an electron. The wave packet corresponds to a 
spread of wavelength around some central wavelength (and hence by de Broglie relation, a 
spread in momentum). Consequently, it is associated with an uncertainty in position (Ax) and an 





uncertainty in momentum (Ap). (b) The matter wave corresponding to a definite momentum of an 


electron extends all over space. In this case, A p = 0 and Ax^». 

In Chapter 12, the de Broglie relation will be seen to justify Bohr’s 
postulate on quantisation of angular momentum of electron in an atom. 

Figure 11.6 shows a schematic diagram of (a) a localised wave packet, 
and (b) an extended wave with fixed wavelength. 


Example 11.4 What is the de Broglie wavelength associated with (a) an electron moving with 
a speed of 5.4x10® m/s, and (b) a ball of mass 150 g travelling at 30.0 m/s? 

Solution 

(a) For the electron: 

Mass m = 9.11 xio - ® 1 kg, speed v = 5.4x10® m/s. Then, momentum p = m v = 9.11 xio - ® 1 
(kg) x 5.4 x io® (m/s) 

p = 4.92 x io -24 kg m/s 
de Broglie wavelength, A = h/p 
fi.fi.'Jx I o S1 ,Js 

'1.92xlO " l m/s 
A = 0.135 nm 

(b) For the ball: 

Mass m’ = 0.150 kg, speed v ’ = 30.0 m/s. 

Then momentum p’ = m’ v’ = 0.150 (kg) x 30.0 (m/s) 
p’= 4.50 kg m/s 

de Broglie wavelength A’ = h/p’. 




_ 6,03x10 :l Js 
4.50 x kgm/s 

A’ = 1.47 xio -34 m 

The de Broglie wavelength of electron is comparable with X-ray wavelengths. However, for 

the ball it is about lO - ^® times the size of the proton, quite beyond experimental 
measurement. 

Example 11.5 An electron, an a-particle, and a proton have the same kinetic energy. Which 
of these particles has the shortest de Broglie wavelength? 

Solution 

For a particle, de Broglie wavelength, A = h/p 
Kinetic energy, K = p 2 /2m 
Then, h j J‘2mK 

For the same kinetic energy K, the de Broglie wavelength associated with the particle is 
inversely proportional to the square root of their masses. A proton ( H) is 1836 times 

massive than an electron and an a-particle ( He ) four times that of a proton. 

Hence, a - particle has the shortest de Broglie wavelength. 


Probability interpretation to matter waves 

It is worth pausing here to reflect on just what a matter wave associated with a particle, say, 
an electron, means. Actually, a truly satisfactory physical understanding of the dual nature of 
matter and radiation has not emerged so far. The great founders of quantum 






mechanics (Niels Bohr, Albert Einstein, and many others) struggled with this and related 
concepts for long. Still the deep physical interpretation of quantum mechanics continues to be 
an area of active research. Despite this, the concept of matter wave has been mathematically 
introduced in modern quantum mechanics with great success. An important milestone in this 
connection was when Max Born (1882-1970) suggested a probability interpretation to the 
matter wave amplitude. According to this, the intensity (square of the amplitude) of the matter 
wave at a point determines the probability density of the particle at that point. Probability 
density means probability per unit volume. Thus, if A is the amplitude of the wave at a 

O 

point, | Ay- At/ is the probability of the particle being found in a small volume At/ around that 
point. Thus, if the intensity of matter wave is large in a certain region, there is a greater 
probability of the particle being found there than where the intensity is small. 


Example 11.6 A particle is moving three times as fast as an electron. The ratio of the de 

Broglie wavelength of the particle to that of the electron is 1.813 * 10“^. Calculate the 
particle’s mass and identify the particle. 

Solution 

de Broglie wavelength of a moving particle, having mass m and 
velocity v: 

, h h 

A ~ ~ 

p rnr 

Mass, m = h/Av 

For an electron, mass m e = h/A e v e 
Now, we have v/v e = 3 and 




f A,, ' 

f ^ 1 



A / 

V V / 


A/A e = 1.813 x 10 -4 
Then, mass of the particle, m = m e 

m = (9.11 xio -31 kg) x (1/3) x (1/1.813 x 10 -4 ) 
m = 1.675 x 10 -27 kg. 

Thus, the particle, with this mass could be a proton or a neutron. 

Example 11.7 What is the de Broglie wavelength associated with an electron, accelerated 
through a potential differnece of 100 volts? 


Solution Accelerating potential V = 100 V. The de Broglie wavelength A is 


1.227 

A = h /p nm 

vV 


1.227 

A nm = 0.123 nm 

v'ioo 


The de Broglie wavelength associated with an electron in this case is of the order of X-ray 
wavelengths. 


11.9 Davisson and Germer Experiment 

The wave nature of electrons was first experimentally verified by C.J. 
Davisson and L.H. Germer in 1927 and independently by G.P. Thomson, 
in 1928, who observed diffraction effects with beams of electrons 
scattered by crystals. Davisson and Thomson shared the Nobel Prize in 
1937 for their experimental discovery of diffraction of electrons by 
crystals. 









The experimental arrange-ment used by Davisson and Germer is 
schematically shown in Fig. 11.7. It consists of an electron gun which 
comprises of a tungsten filament F, coated with barium oxide and heated 
by a low voltage power supply (L.T. or battery). Electrons emitted by the 
filament are accelerated to a desired velocity by applying suitable 
potential/voltage from a high voltage power supply (H.T. or battery). They 
are made to pass through a cylinder with fine holes along its axis, 
producing a fine collimated beam. The beam is made to fall on the 
surface of a nickel crystal. The electrons are scattered in all directions by 
the atoms of the crystal. The intensity of the electron beam, scattered in a 
given direction, is measured by the electron detector (collector). The 
detector can be moved on a circular scale and is connected to a sensitive 
galvanometer, which records the current. The deflection of the 
galvanometer is proportional to the intensity of the electron beam 
entering the collector. The apparatus is enclosed in an evacuated 
chamber. By moving the detector on the circular scale at different 
positions, the intensity of the scattered electron beam is measured for 
different values of angle of scattering 0 which is the angle between the 
incident and the scattered electron beams. The variation of the intensity 
(/) of the scattered electrons with the angle of scattering 0 is obtained for 
different accelerating voltages. 




The experiment was performed by varying the accelarating voltage from 
44 V to 68 V. It was noticed that a strong peak appeared in the intensity 
(/) of the scattered electron for an accelarating voltage of 54V at a 
scattering angle 0 = 50° 

The appearance of the peak in a particular direction is due to the 
constructive interference of electrons scattered from different layers of 
the regularly spaced atoms of the crystals. From the electron diffraction 
measurements, the wavelength of matter waves was found to be 0.165 
nm. 

The de Broglie wavelength A associated with electrons, using Eq. (11.11), 
for V = 54 V is given by 

1 227 
yV 


k = h/p 


nm 

































1 227 

A- t— nm = 0.167 nm 

V54 

Thus, there is an excellent agreement between the theoretical value and 
the experimentally obtained value of de Broglie wavelength. Davisson- 
Germer experiment thus strikingly confirms the wave nature of electrons 
and the de Broglie relation. More recently, in 1989, the wave nature of a 
beam of electrons was experimentally demonstrated in a double-slit 
experiment, similar to that used for the wave nature of light. Also, in an 
experiment in 1994, interference fringes were obtained with the beams of 
iodine molecules, which are about a million times more massive than 
electrons. 

The de Broglie hypothesis has been basic to the development of modern 
quantum mechanics. It has also led to the field of electron optics. The 
wave properties of electrons have been utilised in the design of electron 
microscope which is a great improvement, with higher resolution, over 
the optical microscope. 


Summary 

1. The minimum energy needed by an electron to come out from a metal surface is called the 
work function of the metal. Energy (greater than the work function (cpo) required for electron 

emission from the metal surface can be supplied by suitably heating or applying strong 
electric field or irradiating it by light of suitable frequency. 

2. Photoelectric effect is the phenomenon of emission of electrons by metals when 
illuminated by light of suitable frequency. Certain metals respond to ultraviolet light while 
others are sensitive even to the visible light. 

Photoelectric effect involves conversion of light energy into electrical energy. It follows the law 
of conservation of energy. The photoelectric emission is an instantaneous process and 




possesses certain special features. 


3. Photoelectric current depends on (i) the intensity of incident light, (ii) the potential 
difference applied between the two electrodes, and (iii) the nature of the emitter material. 

4. The stopping potential (V 0 ) depends on (i) the frequency of incident light, and (ii) the 

nature of the emitter material. For a given frequency of incident light, it is independent of its 
intensity. The stopping potential is directly related to the maximum kinetic energy of electrons 
emitted: 

e Vg = (1/2) m v^ max = K max . 

5. Below a certain frequency (threshold frequency) vg, characteristic of the metal, no 
photoelectric emission takes place, no matter how large the intensity may be. 

6. The classical wave theory could not explain the main features of photoelectric effect. Its 
picture of continuous absorption of energy from radiation could not explain the independence 
of K max on intensity, the existence of v 0 and the instantaneous nature of the process. 

Einstein explained these features on the basis of photon picture of light. According to this, 
light is composed of discrete packets of energy called quanta or photons. Each photon 
carries an energy E (= h v) and momentum p (= h/A), which depend on the frequency (v ) of 
incident light and not on its intensity. Photoelectric emission from the metal surface occurs 
due to absorption of a photon by an electron. 

7. Einstein’s photoelectric equation is in accordance with the energy conservation law as 
applied to the photon absorption by an electron in the metal. The maximum kinetic energy 

O 

(1/2)m v m ax is equal to the photon energy (hv) minus the work function cpg (= hvg) of the 
target metal: 

1 9 

- m *tnax = V 0 e = hv - <p 0 = h (v - v 0 ) 

This photoelectric equation explains all the features of the photoelectric effect. Millikan’s first 
precise measurements confirmed the Einstein’s photoelectric equation and obtained an 


accurate value of Planck’s constant h. This led to the acceptance of particle or photon 
description (nature) of electromagnetic radiation, introduced by Einstein. 

8. Radiation has dual nature: wave and particle. The nature of experiment determines 
whether a wave or particle description is best suited for understanding the experimental 
result. Reasoning that radiation and matter should be symmetrical in nature, Louis Victor de 
Broglie attributed a wave-like character to matter (material particles). The waves associated 
with the moving material particles are called matter waves or de Broglie waves. 

9. The de Broglie wavelength (A) associated with a moving particle is related to its momentum 
p as: A = h/p. The dualism of matter is inherent in the de Broglie relation which contains a 
wave concept (A) and a particle concept (p). The de Broglie wavelength is independent of the 
charge and nature of the material particle. It is significantly measurable (of the order of the 
atomic-planes spacing in crystals) only in case of sub-atomic particles like electrons, protons, 
etc. (due to smallness of their masses and hence, momenta). However, it is indeed very 
small, quite beyond measurement, in case of macroscopic objects, commonly encountered in 
everyday life. 

10. Electron diffraction experiments by Davisson and Germer, and by G. P. Thomson, as well 
as many later experiments, have verified and confirmed the wave-nature of electrons. The de 
Broglie hypothesis of matter waves supports the Bohr’s concept of stationary orbits. 
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Points to Ponder 

1. Free electrons in a metal are free in the sense that they move inside the metal in a 
constant potential (This is only an approximation). They are not free to move out of the metal. 
They need additional energy to get out of the metal. 

2. Free electrons in a metal do not all have the same energy. Like molecules in a gas jar, the 
electrons have a certain energy distribution at a given temperature. This distribution is 
different from the usual Maxwell’s distribution that you have learnt in the study of kinetic 
theory of gases. You will learn about it in later courses, but the difference has to do with the 
fact that electrons obey Pauli’s exclusion principle. 

3. Because of the energy distribution of free electrons in a metal, the energy required by an 
electron to come out of the metal is different for different electrons. Electrons with higher 
energy require less additional energy to come out of the metal than those with lower 
energies. Work function is the least energy required by an electron to come out of the metal. 

4. Observations on photoelectric effect imply that in the event of matter-light interaction, 
absorption of energy takes place in discrete units of hv. This is not quite the same as saying 
that light consists of particles, each of energy hv. 









5. Observations on the stopping potential (its independence of intensity and dependence on 
frequency) are the crucial discriminator between the wave-picture and photon-picture of 
photoelectric effect. 

6. The wavelength of a matter wave given by A has physical significance; its phase 

P 

velocity v p has no physical significance. However, the group velocity of the matter wave is 
physically meaningful and equals the velocity of the particle. 


- \ 

Exercises 

11.1 Find the 

(a) maximum frequency, and 

(b) minimum wavelength of X-rays produced by 30 kV electrons. 

11.2 The work function of caesium metal is 2.14 eV. When light of 

frequency 6 *10 14 Hz is incident on the metal surface, photoemission 
of electrons occurs. What is the 

(a) maximum kinetic energy of the emitted electrons, 

(b) Stopping potential, and 

(c) maximum speed of the emitted photoelectrons? 

11.3 The photoelectric cut-off voltage in a certain experiment is 1.5 V. 
What is the maximum kinetic energy of photoelectrons emitted? 

11.4 Monochromatic light of wavelength 632.8 nm is produced by a 
helium-neon laser. The power emitted is 9.42 mW. 

(a) Find the energy and momentum of each photon in the light beam, 







(b) How many photons per second, on the average, arrive at a target 
irradiated by this beam? (Assume the beam to have uniform cross- 
section which is less than the target area), and 

(c) How fast does a hydrogen atom have to travel in order to have the 
same momentum as that of the photon? 

11.5 The energy flux of sunlight reaching the surface of the earth is 

1.388 x 10 3 W/m 2 . How many photons (nearly) per square metre are 
incident on the Earth per second? Assume that the photons in the 
sunlight have an average wavelength of 550 nm. 

11.6 In an experiment on photoelectric effect, the slope of the cut-off 

voltage versus frequency of incident light is found to be 4.12 * 10 -15 
V s. Calculate the value of Planck’s constant. 

11.7 A 100W sodium lamp radiates energy uniformly in all directions. 
The lamp is located at the centre of a large sphere that absorbs all 
the sodium light which is incident on it. The wavelength of the sodium 
light is 589 nm. (a) What is the energy per photon associated with the 
sodium light? (b) At what rate are the photons delivered to the 
sphere? 

11.8 The threshold frequency for a certain metal is 3.3 * 10 14 Hz. If 

light of frequency 8.2 x io 14 Hz is incident on the metal, predict the 
cut-off voltage for the photoelectric emission. 

11.9 The work function for a certain metal is 4.2 eV. Will this metal 
give photoelectric emission for incident radiation of wavelength 330 
nm? 

11.10 Light of frequency 7.21 x io 14 Hz is incident on a metal 
surface. Electrons with a maximum speed of 6.0 x io 5 m/s are 




ejected from the surface. What is the threshold frequency for 
photoemission of electrons? 

11.11 Light of wavelength 488 nm is produced by an argon laser 
which is used in the photoelectric effect. When light from this spectral 
line is incident on the emitter, the stopping (cut-off) potential of 
photoelectrons is 0.38 V. Find the work function of the material from 
which the emitter is made. 

11.12 Calculate the 

(a) momentum, and 

(b) de Broglie wavelength of the electrons accelerated through a 
potential difference of 56 V. 

11.13 What is the 

(a) momentum, 

(b) speed,and 

(c) de Broglie wavelength of an electron with kinetic energy of 120 
eV. 

11.14 The wavelength of light from the spectral emission line of 
sodium is 589 nm. Find the kinetic energy at which 

(a) an electron, and 

(b) a neutron, would have the same de Broglie wavelength. 

11.15 What is the de Broglie wavelength of 

(a) a bullet of mass 0.040 kg travelling at the speed of 1.0 km/s, 

(b) a ball of mass 0.060 kg moving at a speed of 1.0 m/s, and 

(c) a dust particle of mass 1.0 * 10 -9 kg drifting with a speed of 2.2 




m/s? 


11.16 An electron and a photon each have a wavelength of 1.00 nm. 
Find 

(a) their momenta, 

(b) the energy of the photon, and 

(c) the kinetic energy of electron. 

11.17 (a) For what kinetic energy of a neutron will the associated de 
Broglie wavelength be 1.40 * 10 -10 m? 

(b) Also find the de Broglie wavelength of a neutron, in thermal 
equilibrium with matter, having an average kinetic energy of (3/2) k T 
at 300 K. 

11.18 Show that the wavelength of electromagnetic radiation is equal 
to the de Broglie wavelength of its quantum (photon). 

11.19 What is the de Broglie wavelength of a nitrogen molecule in air 
at 300 K? Assume that the molecule is moving with the root-mean- 
square speed of molecules at this temperature. (Atomic mass of 
nitrogen = 14.0076 u) 

Additional Exercises 

11.20 (a) Estimate the speed with which electrons emitted from a 
heated emitter of an evacuated tube impinge on the collector 
maintained at a potential difference of 500 V with respect to the 
emitter. Ignore the small initial speeds of the electrons. The specific 

charge of the electron, i.e., its e/m is given to be 1.76 * 10 11 C kg -1 . 

(b) Use the same formula you employ in (a) to obtain electron speed 
for an collector potential of 10 MV. Do you see what is wrong ? In 
what way is the formula to be modified? 




11.21 (a) A monoenergetic electron beam with electron speed of 5.20 

x 10 6 m s -1 is subject to a magnetic field of 1.30 * 10 -4 T normal to 
the beam velocity. What is the radius of the circle traced by the beam, 

given e/m for electron equals 1.76 * lO^C kg -1 . 

(b) Is the formula you employ in (a) valid for calculating radius of the 
path of a 20 MeV electron beam? If not, in what way is it modified? 

[Note: Exercises 11.20(b) and 11.21(b) take you to relativistic 
mechanics which is beyond the scope of this book. They have been 
inserted here simply to emphasise the point that the formulas you use 
in part (a) of the exercises are not valid at very high speeds or 
energies. See answers at the end to know what ‘very high speed or 
energy’ means.] 

11.22 An electron gun with its collector at a potential of 100 V fires 
out electrons in a spherical bulb containing hydrogen gas at low 

pressure (~10 -2 mm of Hg). A magnetic field of 2.83 * 10 -4 T curves 
the path of the electrons in a circular orbit of radius 12.0 cm. (The 
path can be viewed because the gas ions in the path focus the beam 
by attracting electrons, and emitting light by electron capture; this 
method is known as the ‘fine beam tube’ method.) Determine 
e/m from the data. 

11.23 (a) An X-ray tube produces a continuous spectrum of radiation 
with its short wavelength end at 0.45 A. What is the maximum energy 
of a photon in the radiation? 

(b) From your answer to (a), guess what order of accelerating voltage 
(for electrons) is required in such a tube? 

11.24 In an accelerator experiment on high-energy collisions of 
electrons with positrons, a certain event is interpreted as annihilation 




of an electron-positron pair of total energy 10.2 BeV into two y-rays of 
equal energy. What is the wavelength associated with each y-ray? 

(IBeV = 10 9 eV) 

11.25 Estimating the following two numbers should be interesting. 
The first number will tell you why radio engineers do not need to 
worry much about photons! The second number tells you why our 
eye can never ‘count photons’, even in barely detectable light. 

(a) The number of photons emitted per second by a Medium wave 
transmitter of 10 kW power, emitting radiowaves of wavelength 500 
m. 

(b) The number of photons entering the pupil of our eye per second 
corresponding to the minimum intensity of white light that we humans 

can perceive (~10 -1 ° W rrr 2 ). Take the area of the pupil to be about 
0.4 cm 2 , and the average frequency of white light to be about 6 * 
10 14 Hz. 

11.26 Ultraviolet light of wavelength 2271 A from a 100 W mercury 
source irradiates a photo-cell made of molybdenum metal. If the 
stopping potential is -1.3 V, estimate the work function of the metal. 

How would the photo-cell respond to a high intensity (~10 5 W rrr 2 ) 
red light of wavelength 6328 A produced by a He-Ne laser? 

11.27 Monochromatic radiation of wavelength 640.2 nm (Inm = 10“ 

9 m) from a neon lamp irradiates photosensitive material made of 
caesium on tungsten. The stopping voltage is measured to be 0.54 V. 
The source is replaced by an iron source and its 427.2 nm line 
irradiates the same photo-cell. Predict the new stopping voltage. 

11.28 A mercury lamp is a convenient source for studying frequency 





dependence of photoelectric emission, since it gives a number of 
spectral lines ranging from the UV to the red end of the visible 
spectrum. In our experiment with rubidium photo-cell, the following 
lines from a mercury source were used: 

A 1 = 3650 A, A 2 = 4047 A, A 3 = 4358 A, A 4 = 5461 A, A 5 = 6907 A, 

The stopping voltages, respectively, were measured to be: 

V 01 = 1.28 V, V 02 = 0.95 V, V 03 = 0.74 V, V 04 = 0.16 V, V 05 = 0 V 

Determine the value of Planck’s constant h, the threshold frequency 
and work function for the material. 

[Note: You will notice that to get h from the data, you will need to 

know e (which you can take to be 1.6 * 10 -19 C). Experiments of this 
kind on Na, Li, K, etc. were performed by Millikan, who, using his own 
value of e (from the oil-drop experiment) confirmed Einstein’s 
photoelectric equation and at the same time gave an independent 
estimate of the value of h.] 

11.29 The work function for the following metals is given: 

Na: 2.75 eV; K: 2.30 eV; Mo: 4.17 eV; Ni: 5.15 eV. Which of these 
metals will not give photoelectric emission for a radiation of 
wavelength 3300 A from a He-Cd laser placed 1 m away from the 
photocell? What happens if the laser is brought nearer and placed 50 
cm away? 

11.30 Light of intensity 10 -5 W rrr 2 falls on a sodium photo-cell of 

surface area 2 cm 2 . Assuming that the top 5 layers of sodium absorb 
the incident energy, estimate time required for photoelectric emission 
in the wave-picture of radiation. The work function for the metal is 
given to be about 2 eV. What is the implication of your answer? 




11.31 Crystal diffraction experiments can be performed using X-rays, 
or electrons accelerated through appropriate voltage. Which probe 
has greater energy? (For quantitative comparison, take the 
wavelength of the probe equal to 1 A, which is of the order of inter¬ 
atomic spacing in the lattice) (m e =9.11 * 10 -31 kg). 

11.32 (a) Obtain the de Broglie wavelength of a neutron of kinetic 
energy 150 eV. As you have seen in Exercise 11.31, an electron 
beam of this energy is suitable for crystal diffraction experiments. 
Would a neutron beam of the same energy be equally suitable? 

Explain. (m n = 1.675 * 10“ 27 kg) 

(b) Obtain the de Broglie wavelength associated with thermal 
neutrons at room temperature (27 °C). Hence explain why a fast 
neutron beam needs to be thermalised with the environment before it 
can be used for neutron diffraction experiments. 

11.33 An electron microscope uses electrons accelerated by a 
voltage of 50 kV. Determine the de Broglie wavelength associated 
with the electrons. If other factors (such as numerical aperture, etc.) 
are taken to be roughly the same, how does the resolving power of 
an electron microscope compare with that of an optical microscope 
which uses yellow light? 

11.34 The wavelength of a probe is roughly a measure of the size of 
a structure that it can probe in some detail. The quark structure of 

protons and neutrons appears at the minute length-scale of 10 -15 m 
or less. This structure was first probed in early 1970’s using high 
energy electron beams produced by a linear accelerator at Stanford, 
USA. Guess what might have been the order of energy of these 
electron beams. (Rest mass energy of electron = 0.511 MeV.) 





11.35 Find the typical de Broglie wavelength associated with a He 
atom in helium gas at room temperature (27 °C) and 1 atm pressure; 
and compare it with the mean separation between two atoms under 
these conditions. 

11.36 Compute the typical de Broglie wavelength of an electron in a 
metal at 27 °C and compare it with the mean separation between two 

electrons in a metal which is given to be about 2 x 10 -10 m. 

[Note: Exercises 11.35 and 11.36 reveal that while the wave-packets 
associated with gaseous molecules under ordinary conditions are 
non-overlapping, the electron wave-packets in a metal strongly 
overlap with one another. This suggests that whereas molecules in 
an ordinary gas can be distinguished apart, electrons in a metal 
cannot be distintguished apart from one another. This 
indistinguishibility has many fundamental implications which you will 
explore in more advanced Physics courses.] 

11.37 Answer the following questions: 

(a) Quarks inside protons and neutrons are thought to carry fractional 
charges [(+2/3)e ; (-1/3)e]. Why do they not show up in Millikan’s oil- 
drop experiment? 

(b) What is so special about the combination e/m? Why do we not 
simply talk of e and m separately? 

(c) Why should gases be insulators at ordinary pressures and start 
conducting at very low pressures? 

(d) Every metal has a definite work function. Why do all 
photoelectrons not come out with the same energy if incident 
radiation is monochromatic? Why is there an energy distribution of 
photoelectrons? 





(e) The energy and momentum of an electron are related to the 
frequency and wavelength of the associated matter wave by the 
relations: 

, h 

E = h v, p = - 

A 

But while the value of A is physically significant, the value of v (and 
therefore, the value of the phase speed v A) has no physical 
significance. Why? 

V_ J 


Appendix 

11.1 The history of wave-particle flip-flop 

What is light? This question has haunted mankind for a long time. But 
systematic experiments were done by scientists since the dawn of the 
scientific and industrial era, about four centuries ago. Around the same 
time, theoretical models about what light is made of were developed. 
While building a model in any branch of science, it is essential to see that 
it is able to explain all the experimental observations existing at that time. 
It is therefore appropriate to summarize some observations about light 
that were known in the seventeenth century. 

The properties of light known at that time included (a) rectilinear 
propagation of light, (b) reflection from plane and curved surfaces, (c) 
refraction at the boundary of two media, (d) dispersion into various 
colours, (e) high speed. Appropriate laws were formulated for the first 
four phenomena. For example, Snell formulated his laws of refraction in 
1621. Several scientists right from the days of Galileo had tried to 
measure the speed of light. But they had not been able to do so. They 





had only concluded that it was higher than the limit of their measurement. 

Two models of light were also proposed in the seventeenth century. 
Descartes, in early decades of seventeenth century, proposed that light 
consists of particles, while Huygens, around 1650-60, proposed that light 
consists of waves. Descartes' proposal was merely a philosophical 
model, devoid of any experiments or scientific arguments. Newton soon 
after, around 1660-70, extended Descartes' particle model, known as 
corpuscular theory, built it up as a scientific theory, and explained various 
known properties with it. These models, light as waves and as particles, 
in a sense, are quite opposite of each other. But both models could 
explain all the known properties of light. There was nothing to choose 
between them. 

The history of the development of these models over the next few 
centuries is interesting. Bartholinus, in 1669, discovered double refraction 
of light in some crystals, and Huygens, in 1678, was quick to explain it on 
the basis of his wave theory of light. In spite of this, for over one hundred 
years, Newton’s particle model was firmly believed and preferred over the 
wave model. This was partly because of its simplicity and partly because 
of Newton’s influence on contemporary physics. 

Then in 1801, Young performed his double-slit experiment and observed 
interference fringes. This phenomenon could be explained only by wave 
theory. It was realized that diffraction was also another phenomenon 
which could be explained only by wave theory. In fact, it was a natural 
consequence of Huygens idea of secondary wavelets emanating from 
every point in the path of light. These experiments could not be explained 
by assuming that light consists of particles. Another phenomenon of 
polarisation was discovered around 1810, and this too could be naturally 
explained by the wave theory. Thus wave theory of Huygens came to the 
forefront and Newton’s particle theory went into the background. This 



situation again continued for almost a century. 

Better experiments were performed in the nineteenth century to 
determine the speed of light. With more accurate experiments, a value of 

3><10 8 m/s for speed of light in vacuum was arrived at. Around 1860, 
Maxwell proposed his equations of electromagnetism and it was realized 
that all electromagnetic phenomena known at that time could be 
explained by Maxwell’s four equations. Soon Maxwell showed that 
electric and magnetic fields could propagate through empty space 
(vacuum) in the form of electromagnetic waves. He calculated the speed 

of these waves and arrived at a theoretical value of 2.998x10 8 m/s. The 
close agreement of this value with the experimental value suggested that 
light consists of electromagnetic waves. In 1887 Hertz demonstrated the 
generation and detection of such waves. This established the wave 
theory of light on a firm footing. We might say that while eighteenth 
century belonged to the particle model, the nineteenth century belonged 
to the wave model of light. 

Vast amounts of experiments were done during the period 1850-1900 on 
heat and related phenomena, an altogether different area of physics. 
Theories and models like kinetic theory and thermodynamics were 
developed which quite successfully explained the various phenomena, 
except one. 

Every body at any temperature emits radiation of all wavelengths. It also 
absorbs radiation falling on it. A body which absorbs all the radiation 
falling on it is called a black body. It is an ideal concept in physics, like 
concepts of a point mass or uniform motion. A graph of the intensity of 
radiation emitted by a body versus wavelength is called the black body 
spectrum. No theory in those days could explain the complete black body 
spectrum! 



In 1900, Planck hit upon a novel idea. If we assume, he said, that 
radiation is emitted in packets of energy instead of continuously as in a 
wave, then we can explain the black body spectrum. Planck himself 
regarded these quanta, or packets, as a property of emission and 
absorption, rather than that of light. He derived a formula which agreed 
with the entire spectrum. This was a confusing mixture of wave and 
particle pictures - radiation is emitted as a particle, it travels as a wave, 
and is again absorbed as a particle! Moreover, this put physicists in a 
dilemma. Should we again accept the particle picture of light just to 
explain one phenomenon? Then what happens to the phenomena of 
interference and diffraction which cannot be explained by the particle 
model? 

But soon in 1905, Einstein explained the photoelectric effect by assuming 
the particle picture of light. In 1907, Debye explained the low temperature 
specific heats of solids by using the particle picture for lattice vibrations in 
a crystalline solid. Both these phenomena belonging to widely diverse 
areas of physics could be explained only by the particle model and not by 
the wave model. In 1923, Compton’s x-ray scattering experiments from 
atoms also went in favour of the particle picture. This increased the 
dilemma further. 

Thus by 1923, physicists faced with the following situation, (a) There 
were some phenomena like rectilinear propagation, reflection, refraction, 
which could be explained by either particle model or by wave model, (b) 
There were some phenomena such as diffraction and interference which 
could be explained only by the wave model but not by the particle model, 
(c) There were some phenomena such as black body radiation, 
photoelectric effect, and Compton scattering which could be explained 
only by the particle model but not by the wave model. Somebody in those 
days aptly remarked that light behaves as a particle on Mondays, 



Wednesdays and Fridays, and as a wave on Tuesdays, Thursdays and 
Saturdays, and we don’t talk of light on Sundays! 

In 1924, de Broglie proposed his theory of wave-particle duality in which 
he said that not only photons of light but also ‘particles’ of matter such as 
electrons and atoms possess a dual character, sometimes behaving like 
a particle and sometimes as a wave. He gave a formula connecting their 
mass, velocity, momentum (particle characteristics), with their wavelength 
and frequency (wave characteristics)! In 1927 Thomson, and Davisson 
and Germer, in separate experiments, showed that electrons did behave 
like waves with a wavelength which agreed with that given by de 
Broglie’s formula. Their experiment was on diffraction of electrons 
through crystalline solids, in which the regular arrangement of atoms 
acted like a grating. Very soon, diffraction experiments with other 
‘particles’ such as neutrons and protons were performed and these too 
confirmed with de Broglie’s formula. This confirmed wave-particle duality 
as an established principle of physics. Here was a principle, physicists 
thought, which explained all the phenomena mentioned above not only 
for light but also for the so-called particles. 

But there was no basic theoretical foundation for wave-particle duality. De 
Broglie’s proposal was merely a qualitative argument based on symmetry 
of nature. Wave-particle duality was at best a principle, not an outcome of 
a sound fundamental theory. It is true that all experiments whatever 
agreed with de Broglie formula. But physics does not work that way. On 
the one hand, it needs experimental confirmation, while on the other 
hand, it also needs sound theoretical basis for the models proposed. This 
was developed over the next two decades. Dirac developed his theory of 
radiation in about 1928, and Heisenberg and Pauli gave it a firm footing 
by 1930. Tomonaga, Schwinger, and Feynman, in late 1940s, produced 
further refinements and cleared the theory of inconsistencies which were 



noticed. All these theories mainly put wave-particle duality on a 
theoretical footing. 

Although the story continues, it grows more and more complex and 
beyond the scope of this note. But we have here the essential structure 
of what happened, and let us be satisfied with it at the moment. Now it is 
regarded as a natural consequence of present theories of physics that 
electromagnetic radiation as well as particles of matter exhibit both wave 
and particle properties in different experiments, and sometimes even in 
the different parts of the same experiment. 
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Chapter Twelve 


Atoms 


12.1 Introduction 

By the nineteenth century, enough evidence had accumulated in favour 
of atomic hypothesis of matter. In 1897, the experiments on electric 
discharge through gases carried out by the English physicist J. J. 
Thomson (1856 - 1940) revealed that atoms of different elements 
contain negatively charged constituents (electrons) that are identical for 
all atoms. However, atoms on a whole are electrically neutral. Therefore, 
an atom must also contain some positive charge to neutralise the 
negative charge of the electrons. But what is the arrangement of the 
positive charge and the electrons inside the atom? In other words, what 
is the structure of an atom? 

The first model of atom was proposed by J. J. Thomson in 1898. 
According to this model, the positive charge of the atom is uniformly 
distributed throughout the volume of the atom and the negatively charged 
electrons are embedded in it like seeds in a watermelon. This model was 
picturesquely called plum pudding model of the atom. However 
subsequent studies on atoms, as described in this chapter, showed that 
the distribution of the electrons and positive charges are very different 
from that proposed in this model. 

We know that condensed matter (solids and liquids) and dense gases at 
all temperatures emit electromagnetic radiation in which a continuous 
distribution of several wavelengths is present, though with different 
intensities. This radiation is considered to be due to oscillations of atoms 




and molecules, governed by the interaction of each atom or molecule 
with its neighbours. In contrast, light emitted from rarefied gases heated 
in a flame, or excited electrically in a glow tube such as the familiar neon 
sign or mercury vapour light has only certain discrete wavelengths. The 
spectrum appears as a series of bright lines. In such gases, the average 
spacing between atoms is large. Hence, the radiation emitted can be 
considered due to individual atoms rather than because of interactions 
between atoms or molecules. 


Ernst Rutherford (1871 - 1937) 



Ernst Rutherford (1871 - 1937) New Zealand born, British physicist who did pioneering 
work on radioactive radiation. He discovered alpha-rays and beta-rays. Along with Federick 
Soddy, he created the modern theory of radioactivity. He studied the ‘emanation’ of thorium 
and discovered a new noble gas, an isotope of radon, now known as thoron. By scattering 
alpha-rays from the metal foils, he discovered the atomic nucleus and proposed the plenatery 
model of the atom. He also estimated the approximate size of the nucleus. 


In the early nineteenth century it was also established that each element 
is associated with a characteristic spectrum of radiation, for example, 
hydrogen always gives a set of lines with fixed relative position between 
the lines. This fact suggested an intimate relationship between the 


internal structure of an atom and the spectrum of radiation emitted by it. 
In 1885, Johann Jakob Balmer (1825 - 1898) obtained a simple empirical 
formula which gave the wavelengths of a group of lines emitted by atomic 
hydrogen. Since hydrogen is simplest of the elements known, we shall 
consider its spectrum in detail in this chapter. 

Ernst Rutherford (1871-1937), a former research student of J. J. 
Thomson, was engaged in experiments on a-particles emitted by some 
radioactive elements. In 1906, he proposed a classic experiment of 
scattering of these a-particles by atoms to investigate the atomic 
structure. This experiment was later performed around 1911 by Hans 
Geiger (1882-1945) and Ernst Marsden (1889-1970, who was 20 year- 
old student and had not yet earned his bachelor’s degree). The details 
are discussed in Section 12.2. The explanation of the results led to the 
birth of Rutherford’s planetary model of atom (also called the nuclear 
model of the atom). According to this the entire positive charge and most 
of the mass of the atom is concentrated in a small volume called the 
nucleus with electrons revolving around the nucleus just as planets 
revolve around the sun. 

Rutherford’s nuclear model was a major step towards how we see the 
atom today. However, it could not explain why atoms emit light of only 
discrete wavelengths. How could an atom as simple as hydrogen, 
consisting of a single electron and a single proton, emit a complex 
spectrum of specific wavelengths? In the classical picture of an atom, the 
electron revolves round the nucleus much like the way a planet revolves 
round the sun. However, we shall see that there are some serious 
difficulties in accepting such a model. 

12.2 Alpha-particle Scattering and Rutherford’s Nuclear 
Model of Atom 


At the suggestion of Ernst Rutherford, in 1911, H. Geiger and E. Marsden 
performed some experiments. In one of their experiments, as shown in 
Fig. 12.1, they directed a beam of 5.5 MeV a-particles emitted from a 
radioactive source at a thin metal foil made of gold. Figure 12.2 

shows a schematic diagram of this experiment. Alpha-particles emitted 
by a radioactive source were collimated into a narrow beam by 

their passage through lead bricks. The beam was allowed to fall on a thin 

foil of gold of thickness 2.1 x 10 -7 m. The scattered alpha-particles were 
observed through a rotatable detector consisting of zinc sulphide screen 
and a microscope. The scattered alpha-particles on striking the screen 
produced brief light flashes or scintillations. These flashes may be viewed 
through a microscope and the distribution of the number of scattered 
particles may be studied as a function of angle of scattering. 


Vacuum 



a particles 


Figure 12.1 Geiger-Marsden scattering experiment. The entire apparatus is placed in a vacuum 

chamber (not shown in this figure). 














Thin gold foil 



Figure 12.2 Schematic arrangement of the Geiger-Marsden experiment. 

A typical graph of the total number of a-particles scattered at different 
angles, in a given interval of time, is shown in Fig. 12.3. The dots in this 
figure represent the data points and the solid curve is the theoretical 
prediction based on the assumption that the target atom has a small, 
dense, positively charged nucleus. Many of the a-particles pass through 
the foil. It means that they do not suffer any collisions. Only about 0.14% 
of the incident a-particles scatter by more than 1°; and about 1 in 8000 
deflect by more than 90°. Rutherford argued that, to deflect the a-particle 
backwards, it must experience a large repulsive force. This force could 
be provided if the greater part of the mass of the atom and its positive 
charge were concentrated tightly at its centre. Then the incoming a- 
particle could get very close to the positive charge without penetrating it, 
and such a close encounter would result in a large deflection. This 
agreement supported the hypothesis of the nuclear atom. This is why 
Rutherford is credited with the discovery of the nucleus. 

In Rutherford’s nuclear model of the atom, the entire positive charge and 



























most of the mass of the atom are concentrated in the nucleus with the 
electrons some distance away. The electrons would be moving in orbits 
about the nucleus just as the planets do around the sun. Rutherford’s 

experiments suggested the size of the nucleus to be about 10 -15 m to 10“ 

14 m. From kinetic theory, the size of an atom was known to be 10 -10 m, 
about 10,000 to 100,000 times larger than the size of the nucleus (see 
Chapter 11, Section 11.6 in Class XI Physics textbook). Thus, the 
electrons would seem to be at a distance from the nucleus of about 
10,000 to 100,000 times the size of the nucleus itself. Thus, most of an 
atom is empty space. With the atom being largely empty space, it is easy 
to see why most a-particles go right through a thin metal foil. However, 
when a-particle happens to come near a nucleus, the intense electric 
field there scatters it through a large angle. The atomic electrons, being 
so light, do not appreciably affect the a-particles. 
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Scattering angle 0 (in degree) 


Figure 12.3 Experimental data points (shown by dots) on scattering of a-particles by a thin foil at 
different angles obtained by Geiger and Marsden using the setup shown in Figs. 12.1 and 12.2. 
Rutherford’s nuclear model predicts the solid curve which is seen to be in good agreement with 

experiment. 


The scattering data shown in Fig. 12.3 can be analysed by employing 
Rutherford’s nuclear model of the atom. As the gold foil is very thin, it can 
be assumed that a-particles will suffer not more than one scattering 
during their passage through it. Therefore, computation of the trajectory 
of an alpha-particle scattered by a single nucleus is enough. Alpha- 
particles are nuclei of helium atoms and, therefore, carry two units, 2e, of 
positive charge and have the mass of the helium atom. The charge of the 
gold nucleus is Ze, where Z is the atomic number of the atom; for gold Z 
= 79. Since the nucleus of gold is about 50 times heavier than an a- 
particle, it is reasonable to assume that it remains stationary throughout 






















the scattering process. Under these assumptions, the trajectory of an 
alpha-particle can be computed employing Newton’s second law of 
motion and the Coulomb’s law for electrostatic force of repulsion between 
the alpha-particle and the positively charged nucleus. 


The magnitude of this force is 


F 



{2e){Ze} 


( 12 . 1 ) 


where r is the distance between the a-particle and the nucleus. The force 
is directed along the line joining the a-particle and the nucleus. The 
magnitude and direction of the force on an a-particle continuously 
changes as it approaches the nucleus and recedes away from it. 


12.2.1 Alpha-particle trajectory 

The trajectory traced by an a-particle depends on the impact parameter, 
b of collision. The impact parameter is the perpendicular distance of the 
initial velocity vector of the a-particle from the centre of the nucleus (Fig. 
12.4). A given beam of a-particles has a distribution of impact parameters 
b, so that the beam is scattered in various directions with different 
probabilities (Fig. 12.4). (In a beam, all particles have nearly same kinetic 
energy.) It is seen that an a-particle close to the nucleus (small impact 
parameter) suffers large scattering. In case of head-on collision, the 
impact parameter is minimum and the a-particle rebounds back (0 = tt). 
For a large impact parameter, the a-particle goes nearly undeviated and 
has a small deflection (0 = 0). 

The fact that only a small fraction of the number of incident particles 
rebound back indicates that the number of a-particles undergoing head 
on collision is small. This, in turn, implies that the mass of the atom is 




concentrated in a small volume. Rutherford scattering therefore, is a 
powerful way to determine an upper limit to the size of the nucleus. 



Figure 12.4 Trajectory of a-particles in the coulomb field of a target nucleus. The impact 
parameter, b and scattering angle 9 are also depicted. 


Example 12.1 In the Rutherford’s nuclear model of the atom, the nucleus (radius about 10 

m) is analogous to the sun about which the electron move in orbit (radius * 10 _ 15 m) like 
the earth orbits around the sun. If the dimensions of the solar system had the same 
proportions as those of the atom, would the earth be closer to or farther away from the sun 

11 

than actually it is? The radius of earth’s orbit is about 1.5 x 10 m. The radius of sun is 
taken as 7 x 10® m. 

Solution The ratio of the radius of electron’s orbit to the radius of nucleus is (10 — ^ ® m)/(10“ 
15 m) = 10®, that is, the radius of the electron’s orbit is 10® times larger than the radius of 
nucleus. If the radius of the earth’s orbit around the sun were 10® times larger than the radius 

of the sun, the radius of the earth’s orbit would be 1o5 x 7 x 10® m = 7 x 10^ m. This is 
more than 100 times greater than the actual orbital radius of earth. Thus, the earth would be 
much farther away from the sun. 



















It implies that an atom contains a much greater fraction of empty space than our solar system 
does. 


Example 12.2 In a Geiger-Marsden experiment, what is the distance of closest approach to 
the nucleus of a 7.7 MeV a-particle before it comes momentarily to rest and reverses its 
direction? 

Solution The key idea here is that throughout the scattering process, the total mechanical 
energy of the system consisting of an a-particle and a gold nucleus is conserved. The 
system’s initial mechanical energy is Ej, before the particle and nucleus interact, and it is 

equal to its mechanical energy E f when the a-particle momentarily stops. The initial 
energy Ej is just the kinetic energy K of the incoming 

a- particle. The final energy E f is just the electric potential energy U of the system. The 
potential energy Ucan be calculated from Eq. (12.1). 

Let d be the centre-to-centre distance between the a-particle and the gold nucleus when 
the a-particle is at its stopping point. Then we can write the conservation of energy Ej = E f as 


,. 1 (2e)(Zpl '22e- 

A =--- = -- 

'1 d 'ln/'.d 

Thus the distance of closest approach d is given by 



The maximum kinetic energy found in a-particles of natural origin is 7.7 MeV or 1.2 * 10 -12 J. 
Since 1/4TTeO = 9.0 * 10 9 N m 2 /C 2 . Therefore with e = 1.6 * 10 -19 C, we have, 

(2 HP .Ox IQ^Xnr /C^fl.Cx 1Q- U T )- ! Z 
1.2 x 10“ jj 

= 3.84 x 10 -16 Z m 




The atomic number of foil material gold is Z = 79, so that 
d (Au) = 3.0 x 10“ 14 m = 30 fm. (1 fm (i.e. fermi) = 10“ 15 m.) 

The radius of gold nucleus is, therefore, less than 3.0 * 10 -14 m. This is not in very good 
agreement with the observed result as the actual radius of gold nucleus is 6 fm. The cause of 
discrepancy is that the distance of closest approach is considerably larger than the sum of 
the radii of the gold nucleus and the a-particle. Thus, the a-particle reverses its motion 
without ever actually touching the gold nucleus. 


12.2.2 Electron orbits 

The Rutherford nuclear model of the atom which involves classical 
concepts, pictures the atom as an electrically neutral sphere consisting of 
a very small, massive and positively charged nucleus at the centre 
surrounded by the revolving electrons in their respective dynamically 
stable orbits. The electrostatic force of attraction, F e between the 

revolving electrons and the nucleus provides the requisite centripetal 
force ( F c ) to keep them in their orbits. Thus, for a dynamically stable orbit 

in a hydrogen atom 


r 



( 12 . 2 ) 


Thus the relation between the orbit radius and the electron 
velocity is 



(12.3) 





The kinetic energy (K) and electrostatic potential energy ( U ) of the 
electron in hydrogen atom are 


K - - rmr - and 17 - 

2 @7Tt'..r 


e" 

4 ic r 


(The negative sign in U signifies that the electrostatic force is in the -r 
direction.) Thus the total energy E of the electron in a hydrogen atom is 

p* p* 

1C = K+li = —--- 

8“r 


8ne u r 


(12.4) 


The total energy of the electron is negative. This implies the fact that the 
electron is bound to the nucleus. If E were positive, an electron will not 
follow a closed orbit around the nucleus. 


Example 12.3 It is found experimentally that 13.6 eV energy is required to separate a 
hydrogen atom into a proton and an electron. Compute the orbital radius and the velocity of 
the electron in a hydrogen atom. 

Solution Total energy of the electron in hydrogen atom is -13.6 eV = -13.6 * 1.6 * 10 - ^® J = 
-2.2 xio - ^ J. Thus from Eq. (12.4), we have 

--— = -2.2x10 ls J 

STlX-iT 

This gives the orbital radius 

;h x Iff N rn 2 ,' C a jf 1,6 ’ O' ®CCf 

—ft] -2 2 x 1§“ 18 J] 


ft JT 




= 5.3 x KT 11 m. 

The velocity of the revolving electron can be computed from Eq. (12.3) with m = 9.1 x 1CT 
31 kg, 

e - 

v 2,2x10' m/s. 

J 1 ti 


12.3 Atomic Spectra 

As mentioned in Section 12.1, each element has a characteristic 
spectrum of radiation, which it emits. When an atomic gas or vapour is 
excited at low pressure, usually by passing an electric current through it, 
the emitted radiation has a spectrum which contains certain specific 
wavelengths only. A spectrum of this kind is termed as emission line 
spectrum and it consists of bright lines on a dark background. The 
spectrum emitted by atomic hydrogen is shown in Fig. 12.5. Study of 
emission line spectra of a material can therefore serve as a type of 
“fingerprint” for identification of the gas. When white light passes through 
a gas and we analyse the transmitted light using a spectrometer we find 
some dark lines in the spectrum. These dark lines correspond precisely 
to those wavelengths which were found in the emission line spectrum of 
the gas. This is called the absorption spectrum of the material of the gas. 
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Figure 12.5 Emission lines in the spectrum of hydrogen. 


12.3.1 Spectral series 

We might expect that the frequencies of the light emitted by a particular 
element would exhibit some regular pattern. Hydrogen is the simplest 
atom and therefore, has the simplest spectrum. In the observed 
spectrum, however, at first sight, there does not seem to be any 
resemblance of order or regularity in spectral lines. But the spacing 
between lines within certain sets of the hydrogen spectrum decreases in 
a regular way (Fig. 12.5). Each of these sets is called a spectral series. In 
1885, the first such series was observed by a Swedish school teacher 
Johann Jakob Balmer (1825-1898) in the visible region of the hydrogen 
spectrum. This series is called Balmer series (Fig. 12.6). The line with the 
longest wavelength, 656.3 nm in the red is called Ha; the next line with 

wavelength 486.1 nm in the blue-green is called H(3, the third line 434.1 
nm in the violet is called Hy; and so on. As the wavelength decreases, 

the lines appear closer together and are weaker in intensity. Balmer 
found a simple empirical formula for the observed wavelengths 

























where A is the wavelength, R is a constant called the Rydberg constant, 
and n may have integral values 3, 4, 5, etc. The value of R is 1.097 x io 7 
rrr 1 . This equation is also called Balmer formula. 
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Figure 12.6 Balmer series in the emission spectrum of hydrogen. 


Taking n = 3 in Eq. (12.5), one obtains the wavelength of the Ha line: 


X 


= 1.097 x IO 7 


(i n 


V' 


m 


= 1.522 x io 6 nr 1 


i.e., A = 656.3 nm 

For n = 4, one obtains the wavelength of H(3 line, etc. For n = °°, one 
obtains the limit of the series, at A = 364.6 nm. This is the shortest 
wavelength in the Balmer series. Beyond this limit, no further distinct 
lines appear, instead only a faint continuous spectrum is seen. 

Other series of spectra for hydrogen were subsequently discovered. 











These are known, after their discoverers, as Lyman, Paschen, Brackett, 
and Pfund series. These are represented by the formulae: 

Lyman series : 

1 f \ 1 A 

= H , , n = 2,3,4... (12.6) 

a v 1 r i 

Paschen series : 

l 1 1 

= R . - . n= 4,5,6... (12.7) 

/. {S'- n') 

Brackett series : 

l 1 \ \ 

_=R n = 5,6,7... (12.8) 

/, 14' ri 

Pfund series-. 

1 M 1) 

-, =« , — n = 6,7,8... (12.9) 

la 

The Lyman series is in the ultraviolet, and the Paschen and Brackett 
series are in the infrared region. 

The Balmer formula Eq. (12.5) may be written in terms of frequency of 
the light, recalling that 

c = vA 

1 v 

or — = — 

4 c 

Thus, Eq. (12.5) becomes 

■" 1 ') 

v-Rc — - r (12.10) 

{‘2 ji j 



There are only a few elements (hydrogen, singly ionised helium, and 
doubly ionised lithium) whose spectra can be represented by simple 
formula like Eqs. (12.5) - (12.9). 

Equations (12.5) - (12.9) are useful as they give the wavelengths that 
hydrogen atoms radiate or absorb. However, these results are empirical 
and do not give any reasoning why only certain frequencies are observed 
in the hydrogen spectrum. 

12.4 Bohr Model of the Hydrogen Atom 

The model of the atom proposed by Rutherford assumes that the atom, 
consisting of a central nucleus and revolving electron is stable much like 
sun-planet system which the model imitates. However, there are some 
fundamental differences between the two situations. While the planetary 
system is held by gravitational force, the nucleus-electron system being 
charged objects, interact by Coulomb’s Law of force. We know that an 
object which moves in a circle is being constantly accelerated - the 
acceleration being centripetal in nature. According to classical 
electromagnetic theory, an accelerating charged particle emits radiation 
in the form of electromagnetic waves. The energy of an accelerating 
electron should therefore, continuously decrease. The electron would 
spiral inward and eventually fall into the nucleus (Fig. 12.7). Thus, such 
an atom can not be stable. Further, according to the classical 
electromagnetic theory, the frequency of the electromagnetic waves 
emitted by the revolving electrons is equal to the frequency of revolution. 
As the electrons spiral inwards, their angular velocities and hence their 
frequencies would change continuously, and so will the frequency of the 
light emitted. Thus, they would emit a continuous spectrum, in 
contradiction to the line spectrum actually observed. Clearly Rutherford 
model tells only a part of the story implying that the classical ideas are 


not sufficient to explain the atomic structure. 


Niels Henrik David Bohr (1885 - 1962) 



Niels Henrik David Bohr (1885 - 1962) Danish physicist who explained the spectrum of 
hydrogen atom based on quantum ideas. He gave a theory of nuclear fission based on the 
liquid-drop model of nucleus. Bohr contributed to the clarification of conceptual problems in 
quantum mechanics, in particular by proposing the comple- mentary principle. 
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Figure 12.7 An accelerated atomic electron must spiral into the nucleus as it loses energy. 


Example 12.4 According to the classical electromagnetic theory, calculate the initial 
frequency of the light emitted by the electron revolving around a proton in hydrogen atom. 

Solution From Example 12.3 we know that velocity of electron moving around a proton in 

—11 —R 

hydrogen atom in an orbit of radius 5.3 x 10 11 m is 2.2 x 10 u m/s. Thus, the frequency of 
the electron moving around the proton is 

v _ 2.2 xlO L m s -J 
2 Kr 2 k (5.3 x 10“ u mi 

*6.6 x 10 15 Hz. 

According to the classical electromagnetic theory we know that the 
frequency of the electromagnetic waves emitted by the revolving 
electrons is equal to the frequency of its revolution around the 
nucleus. Thus the initial frequency of the light emitted is 6.6 * 

10 15 Hz. 


It was Niels Bohr (1885 - 1962) who made certain modifications in this 
model by adding the ideas of the newly developing quantum hypothesis. 
Niels Bohr studied in Rutherford’s laboratory for several months in 1912 
and he was convinced about the validity of Rutherford nuclear model. 
Faced with the dilemma as discussed above, Bohr, in 1913, concluded 
that in spite of the success of electromagnetic theory in explaining large- 
scale phenomena, it could not be applied to the processes at the atomic 
scale. It became clear that a fairly radical departure from the established 
principles of classical mechanics and electromagnetism would be needed 




to understand the structure of atoms and the relation of atomic structure 
to atomic spectra. Bohr combined classical and early quantum concepts 
and gave his theory in the form of three postulates. These are : 

(i) Bohr’s first postulate was that an electron in an atom could revolve in 
certain stable orbits without the emission of radiant energy, contrary to 
the predictions of electromagnetic theory. According to this postulate, 
each atom has certain definite stable states in which it can exist, and 
each possible state has definite total energy. These are called the 
stationary states of the atom. 

(ii) Bohr’s second postulate defines these stable orbits. This postulate 
states that the electron revolves around the nucleus only in those orbits 
for which the angular momentum is some integral multiple of h/2n where 

h is the Planck’s constant (= 6.6 * 10 -34 J s). Thus the angular 
momentum ( L ) of the orbiting electron is quantised. That is 

L = nh/2j\ (12.11) 

(iii) Bohr’s third postulate incorporated into atomic theory the early 
quantum concepts that had been developed by Planck and Einstein. It 
states that an electron might make a transition from one of its specified 
non-radiating orbits to another of lower energy When it does so, a 
photon is emitted having energy equal to the energy difference between 
the initial and final states. The frequency of the emitted photon is then 
given by 

/7v = Ej-E f (12.12) 

where Ej and E f are the energies of the initial and final states and Ej > E f . 

For a hydrogen atom, Eq. (12.4) gives the expression to determine the 
energies of different energy states. But then this equation requires the 
radius r of the electron orbit. To calculate r, Bohr’s second postulate 



about the angular momentum of the electron-the quantisation 
condition - is used. The angular momentum L is given by 

L = mvr 

Bohr’s second postulate of quantisation [Eq. (12.11)] says that the 
allowed values of angular momentum are integral multiples of h/2T\. 

. Till . . _ . _. 

L n = mv„r n = — (12.13) 

Z SI 

where n is an integer, r n is the radius of n th possible orbit and v n is the 

speed of moving electron in the n th orbit. The allowed orbits are 
numbered 1, 2, 3 ..., according to the values of n, which is called the 
principal quantum number of the orbit. 

From Eq. (12.3), the relation between v n and r n is 
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^j4rj\rnr : 


Combining it with Eq. (12.13), we get the following expressions for v n and 
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(12.14) 


(12.15) 


Eq. (12.14) depicts that the orbital speed in the n th orbit falls by a factor of 
n. Using Eq. (12.15), the size of the innermost orbit (n = 1) can be 
obtained as 



r 



x me ' 

This is called the Bohr radius, represented by the symbol a 0 . Thus, 


a, = (12.16) 

xrne~ 

Substitution of values of h, m, s 0 and e gives a 0 = 5.29 x 10 -11 m. From 

Eq. (12.15), it can also be seen that the radii of the orbits increase as n 2 . 

The total energy of the electron in the stationary states of the hydrogen 
atom can be obtained by substituting the value of orbital radius in 
Eq. (12.4) as 
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(12.17) 
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Substituting values, Eq. (12.17) yields 

2 . 18 x 10 H 


= 


J (12.18) 


77' 


Atomic energies are often expressed in electron volts (eV) rather than 
joules. Since 1 eV = 1.6 * 10 -19 J, Eq. (12.18) can be rewritten as 


,, 13.6 

K. = - ■, 

M" 


(12.19) 


The negative sign of the total energy of an electron moving in an orbit 
means that the electron is bound with the nucleus. Energy will thus be 
required to remove the electron from the hydrogen atom to a distance 



infinitely far away from its nucleus (or proton in hydrogen atom). 

The derivation of Eqs. (12.17) - (12.19) involves the assumption that the 
electronic orbits are circular, though orbits under inverse square force 
are, in general elliptical. (Planets move in elliptical orbits under the 
inverse square gravitational force of the sun.) However, it was shown by 
the German physicist Arnold Sommerfeld (1868 - 1951) that, when the 
restriction of circular orbit is relaxed, these equations continue to hold 
even for elliptic orbits. 

Orbit vs state (orbital picture) of electron in atom 

We are introduced to the Bohr Model of atom one time or the other in the course of physics. 
This model has its place in the history of quantum mechanics and particularly in explaining 
the structure of an atom. It has become a milestone since Bohr introduced the revolutionary 
idea of definite energy orbits for the electrons, contrary to the classical picture requiring an 
accelerating particle to radiate. Bohr also introduced the idea of quantisation of angular 
momentum of electrons moving in definite orbits. Thus it was a semi-classical picture of the 
structure of atom. 

Now with the development of quantum mechanics, we have a better understanding of the 
structure of atom. Solutions of the Schrodinger wave equation assign a wave-like description 
to the electrons bound in an atom due to attractive forces of the protons. 

An orbit of the electron in the Bohr model is the circular path of motion of an electron around 
the nucleus. But according to quantum mechanics, we cannot associate a definite path with 
the motion of the electrons in an atom. We can only talk about the probability of finding an 
electron in a certain region of space around the nucleus. This probability can be inferred from 
the one-electron wave function called the orbital. This function depends only on the 
coordinates of the electron. 

It is therefore essential that we understand the subtle differences that exist in the two models: 


- Bohr model is valid for only one-electron atoms/ions; an energy value, assigned to each 


orbit, depends on the principal quantum number n in this model. We know that energy 
associated with a stationary state of an electron depends on n only, for one-electron 
atoms/ions. For a multi-electron atom/ion, this is not true. 

- The solution of the Schrodinger wave equation, obtained for hydrogen-like atoms/ions, 
called the wave function, gives information about the probability of finding an electron in 
various regions around the nucleus. This orbital has no resemblance whatsoever with the 
orbit defined for an electron in the Bohr model. 


Example 12.5 A 10 kg satellite circles earth once every 2 h in an orbit having a radius of 
8000 km. Assuming that Bohr’s angular momentum postulate applies to satellites just as it 
does to an electron in the hydrogen atom, find the quantum number of the orbit of the 
satellite. 

Solution 

From Eq. (12.13), we have 
m v n r n = nh/2ir 

Here m = 10 kg and r n = 8 x 10® m. We have the time period T of the circling satellite as 2 h. 
That is T = 7200 s. 

Thus the velocity v n = 2tt r n /T. 

The quantum number of the orbit of satellite 
n = (2tt r n ) 2 x m /(T x h). 

Substituting the values, 

n = (2tt x 8 x io 6 m) 2 x 10/(7200 s x 6.64 x 10“ 34 J s) 




= 5.3 x 10 45 

Note that the quantum number for the satellite motion is extremely large! In fact for such large 
quantum numbers the results of quantisation conditions tend to those of classical physics. 


12.4.1 Energy levels 

The energy of an atom is the least (largest negative value) when its 
electron is revolving in an orbit closest to the nucleus i.e., the one for 
which n = 1. For n = 2,3, ... the absolute value of the energy E is smaller, 
hence the energy is progressively larger in the outer orbits. The lowest 
state of the atom, called the ground state, is that of the lowest energy, 
with the electron revolving in the orbit of smallest radius, the Bohr radius, 
a 0 . The energy of this state (n = 1), E 1 is -13.6 eV. Therefore, the 

minimum energy required to free the electron from the ground state of the 
hydrogen atom is 13.6 eV. It is called the ionisation energy of the 
hydrogen atom. This prediction of the Bohr’s model is in excellent 
agreement with the experimental value of ionisation energy. 

Orbit vs state (orbital picture) of electron in atom. We are introduced to 
the Bohr Model of atom one time or the other in the course of physics. 
This model has its place in the history of quantum mechanics and 
particularly in explaining the structure of an atom. It has become a 
milestone since Bohr introduced the revolutionary idea of definite energy 
orbits for the electrons, contrary to the classical picture requiring an 
accelerating particle to radiate. Bohr also introduced the idea of 
quantisation of angular momentum of electrons moving in definite orbits. 
Thus it was a semi-classical picture of the structure of atom. 

Now with the development of quantum mechanics, we have a better 




understanding of the structure of atom. Solutions of the Schrodinger 
wave equation assign a wave-like description to the electrons bound in 
an atom due to attractive forces of the protons. 

An orbit of the electron in the Bohr model is the circular path of motion of 
an electron around the nucleus. But according to quantum mechanics, 
we cannot associate a definite path with the motion of the electrons in an 
atom. We can only talk about the probability of finding an electron in a 
certain region of space around the nucleus. This probability can be 
inferred from the one-electron wave function called the orbital. This 
function depends only on the coordinates of the electron. 

It is therefore essential that we understand the subtle differences that 
exist in the two models: 

4- Bohr model is valid for only one-electron atoms/ions; an energy value, 
assigned to each orbit, depends on the principal quantum number n in 
this model. We know that energy associated with a stationary state of an 
electron depends on n only, for one-electron atoms/ions. For a multi¬ 
electron atom/ion, this is not true. 

4- The solution of the Schrodinger wave equation, obtained for hydrogen¬ 
like atoms/ions, called the wave function, gives information about the 
probability of finding an electron in various regions around the nucleus. 
This orbital has no resemblance whatsoever with the orbit defined for an 
electron in the Bohr model. 

At room temperature, most of the hydrogen atoms are in ground state. 
When a hydrogen atom receives energy by processes such as electron 
collisions, the atom may acquire sufficient energy to raise the electron to 
higher energy states. The atom is then said to be in an excited state. 
From Eq. (12.19), for n = 2; the energy E 2 is -3.40 eV. It means that the 

energy required to excite an electron in hydrogen atom to its first excited 



state, is an energy equal to E 2 - E 1 = -3.40 eV - (-13.6) eV = 10.2 eV. 

Similarly, E 3 = -1.51 eV and E 3 - E 1 = 12.09 eV, or to excite the 

hydrogen atom from its ground state (n = 1) to second excited state (n = 
3), 12.09 eV energy is required, and so on. From these excited states the 
electron can then fall back to a state of lower energy, emitting a photon in 
the process. Thus, as the excitation of hydrogen atom increases (that is 
as n increases) the value of minimum energy required to free the electron 
from the excited atom decreases. 
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Figure 12.8 The energy level diagram for the hydrogen atom. The electron in a hydrogen atom at 
room temperature spends most of its time in the ground state. To ionise a hydrogen atom an 
electron from the ground state, 13.6 eV of energy must be supplied. (The horizontal lines specify 

the presence of allowed energy states.) 


The energy level diagram* for the stationary states of a hydrogen atom, 
computed from Eq. (12.19), is given in Fig. 12.8. The principal quantum 
number n labels the stationary states in the ascending order of energy. In 
this diagram, the highest energy state corresponds to n =°° in Eq, 
(12.19) and has an energy of 0 eV. This is the energy of the atom when 
the electron is completely removed (r = °°) from the nucleus and is at rest. 
Observe how the energies of the excited states come closer and closer 
together as n increases. 


Franck - Hertz experiment 

The existence of discrete energy levels in an atom was directly verified in 1914 by James 
Franck and Gustav Hertz. They studied the spectrum of mercury vapour when electrons 
having different kinetic energies passed through the vapour. The electron energy was varied 
by subjecting the electrons to electric fields of varying strength. The electrons collide with the 
mercury atoms and can transfer energy to the mercury atoms. This can only happen when 
the energy of the electron is higher than the energy difference between an energy level of Hg 
occupied by an electron and a higher unoccupied level (see Figure). For instance, the 
difference between an occupied energy level of Hg and a higher unoccupied level is 4.9 eV. If 
an electron of having an energy of 4.9 eV or more passes through mercury, an electron in 
mercury atom can absorb energy from the bombarding electron and get excited to the higher 
level [Fig (a)]. The colliding electron’s kinetic energy would reduce by this amount. 
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The excited electron would subsequently fall back to the ground state by emission of 
radiation [Fig. (b)]. The wavelength of emitted radiation is: 

, he 0. G2E x 10"™ x 3 x 10 s 

/ zz-zz - = 253 nm 

E 4.9xl.6xlO“' 1 ' 

By direct measurement, Franck and Hertz found that the emission spectrum of mercury has a 
line corresponding to this wavelength. For this experimental verification of Bohr’s basic ideas 
of discrete energy levels in atoms and the process of photon emission, Frank and Hertz were 
awarded the Nobel prize in 1925 


12.5 The Line Spectra of the Hydrogen Atom 

According to the third postulate of Bohr’s model, when an atom makes a 
transition from the higher energy state with quantum number n, to the 

lower energy state with quantum number n f ( n f < n,), the difference of 

energy is carried away by a photon of frequency v if such that 

hVjf = Eri'i — Erif ( 12 . 20 ) 


Using Eq. (12.16), for Er? f and Er?j, we get 
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( 12 . 21 ) 













( 12 . 22 ) 


or v if = 


me 1 

Se^h 2 




Equation (12.21) is the Rydberg formula, for the spectrum of the 
hydrogen atom. In this relation, if we take r? f = 2 and r?j = 3, 4, 5..., it 

reduces to a form similar to Eq. (12.10) for the Balmer series. The 
Rydberg constant R is readily identified to be 



This is a value very close to the value (1.097 x io 7 rrr 1 ) obtained from 
the empirical Balmer formula. This agreement between the theoretical 
and experimental values of the Rydberg constant provided a direct and 
striking confirmation of the Bohr’s model. 

Since both n f and r?j are integers, this immediately shows that in 

transitions between different atomic levels, light is radiated in various 
discrete frequencies. For hydrogen spectrum, the Balmer formula 
corresponds to n f = 2 and r?j = 3, 4, 5, etc. The results of the Bohr’s model 

suggested the presence of other series spectra for hydrogen atom-those 
corresponding to transitions resulting from n f = 1 and r?j = 2, 3, etc.; n f = 3 

and /ij = 4, 5, etc., and so on. Such series were identified in the course of 

spectroscopic investigations and are known as the Lyman, Balmer, 
Paschen, Brackett, and Pfund series. The electronic transitions 
corresponding to these series are shown in Fig. 12.9. 

The various lines in the atomic spectra are produced when electrons 



jump from higher energy state to a lower energy state and photons are 
emitted. These spectral lines are called emission lines. But when an atom 
absorbs a photon that has precisely the same energy needed by the 
electron in a lower energy state to make transitions to a higher energy 
state, the process is called absorption. Thus if photons with a continuous 
range of frequencies pass through a rarefied gas and then are analysed 
with a spectrometer, a series of dark spectral absorption lines appear in 
the continuous spectrum. The dark lines indicate the frequencies that 
have been absorbed by the atoms of the gas. 

The explanation of the hydrogen atom spectrum provided by Bohr’s 
model was a brilliant achievement, which greatly stimulated progress 
towards the modern quantum theory. In 1922, Bohr was awarded Nobel 
Prize in Physics. 
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Figure 12.9 Line spectra originate in transitions between energy levels. 


Example 12.6 Using the Rydberg formula, calculate the wavelengths of the first four spectral 
lines in the Lyman series of the hydrogen spectrum. 

Solution The Rydberg formula is 
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The wavelengths of the first four lines in the Lyman series correspond to transitions from nj 
2,3,4,5 to nf = 1. We know that 
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Substituting n; = 2,3,4,5, we get A 21 = 1218 A, A 31 = 1028 A, A 41 = 974.3 A, and A 51 = 951.4 A. 


12.6 De Broglie’s Explanation of Bohr’s Second Postulate 
of Quantisation 

Of all the postulates, Bohr made in his model of the atom, perhaps the 
most puzzling is his second postulate. It states that the angular 
momentum of the electron orbiting around the nucleus is quantised (that 
is, Z. n = nh/2u\ n = 1, 2, 3 ...). Why should the angular momentum have 

only those values that are integral multiples of /7/2tt? The French 
physicist Louis de Broglie explained this puzzle in 1923, ten years after 











Bohr proposed his model. 

We studied, in Chapter 11, about the de Broglie’s hypothesis that material 
particles, such as electrons, also have a wave nature. C. J. Davisson and 
L. H. Germer later experimentally verified the wave nature of electrons in 
1927. Louis de Broglie argued that the electron in its circular orbit, as 
proposed by Bohr, must be seen as a particle wave. In analogy to waves 
travelling on a string, particle waves too can lead to standing waves 
under resonant conditions. From Chapter 15 of Class XI Physics 
textbook, we know that when a string is plucked, a vast number of 
wavelengths are excited. However only those wavelengths survive which 
have nodes at the ends and form the standing wave in the string. It 
means that in a string, standing waves are formed when the total 
distance travelled by a wave down the string and back is one wavelength, 
two wavelengths, or any integral number of wavelengths. Waves with 
other wavelengths interfere with themselves upon reflection and their 

amplitudes quickly drop to zero. For an electron moving in r? th circular 
orbit of radius r n , the total distance is the circumference of the orbit, 2tt r n . 

Thus 

2tt r n = nK, n = 1,2, 3... (12.24) 

Figure 12.10 illustrates a standing particle wave on a circular orbit for n = 
4, i.e., 2tt r n = 4A, where A is the de Broglie wavelength of the electron 

moving in r? th orbit. From Chapter 11, we have A = h/p, where p is the 
magnitude of the electron’s momentum. If the speed of the electron is 
much less than the speed of light, the momentum is mv n . Thus, A = 

h/mv n . From Eq. (12.24), we have 
2tt r n = n h/mv n or m v n r n = nh/2n 



This is the quantum condition proposed by Bohr for the angular 
momentum of the electron [Eq. (12.13)]. In Section 12.5, we saw that this 
equation is the basis of explaining the discrete orbits and energy levels in 
hydrogen atom. Thus de Broglie hypothesis provided an explanation for 
Bohr’s second postulate for the quantisation of angular momentum of the 
orbiting electron. The quantised electron orbits and energy states are due 
to the wave nature of the electron and only resonant standing waves can 
persist. 

Bohr’s model, involving classical trajectory picture (planet-like electron 
orbiting the nucleus), correctly predicts the gross features of the 
hydrogenic atoms*, in particular, the frequencies of the radiation emitted 
or selectively absorbed. This model however has many limitations. 

Some are: 



Figure 12.10 A standing wave is shown on a circular orbit where four de Broglie wavelengths fit 

into the circumference of the orbit. 


(i) The Bohr model is applicable to hydrogenic atoms. It cannot be 






extended even to mere two electron atoms such as helium. The analysis 
of atoms with more than one electron was attempted on the lines of 
Bohr’s model for hydrogenic atoms but did not meet with any success. 
Difficulty lies in the fact that each electron interacts not only with the 
positively charged nucleus but also with all other electrons. 

* Hydrogenic atoms are the atoms consisting of a nucleus with positive charge+Ze and a single 
electron, where Z is the proton number. Examples are hydrogenatom, singly ionised helium, 
doubly ionised lithium, and so forth. In theseatoms more complex electron-electron interactions 
are nonexistent 

The formulation of Bohr model involves electrical force between positively 
charged nucleus and electron. It does not include the electrical forces 
between electrons which necessarily appear in multi-electron atoms. 

(ii) While the Bohr’s model correctly predicts the frequencies of the light 
emitted by hydrogenic atoms, the model is unable to explain the relative 
intensities of the frequencies in the spectrum. In emission spectrum of 
hydrogen, some of the visible frequencies have weak intensity, others 
strong. Why? Experimental observations depict that some transitions are 
more favoured than others. Bohr’s model is unable to account for the 
intensity variations. 

Bohr’s model presents an elegant picture of an atom and cannot be 
generalised to complex atoms. For complex atoms we have to use a new 
and radical theory based on Quantum Mechanics, which provides a more 
complete picture of the atomic structure. 


Laser light 

Imagine a crowded market place or a railway platform with people entering a gate and going 
towards all directions. Their footsteps are random and there is no phase correlation between 


them. On the other hand, think of a large number of soldiers in a regulated march. Their 
footsteps are very well correlated. See figure here. 

This is similar to the difference between light emitted by an ordinary source like a candle or a 
bulb and that emitted by a laser. The acronym LASER stands for Light Amplification by 
Stimulated Emission of Radiation. Since its development in 1960, it has entered into all areas 
of science and technology. It has found applications in physics, chemistry, biology, medicine, 
surgery, engineering, etc. There are low power lasers, with a power of 0.5 mW, called pencil 
lasers, which serve as pointers. There are also lasers of different power, suitable for delicate 
surgery of eye or glands in the stomach. Finally, there are lasers which can cut or weld steel. 



(a) Light from a bulb 


(b) Laser light 

Light is emitted from a source in the form of packets of waves. Light coming out from an 
ordinary source contains a mixture of many wavelengths. There is also no phase relation 
between the various waves. Therefore, such light, even if it is passed through an aperture, 
spreads very fast and the beam size increases rapidly with distance. In the case of laser light, 
the wavelength of each packet is almost the same. Also the average length of the packet of 
waves is much larger. This means that there is better phase correlation over a longer duration 
of time. This results in reducing the divergence of a laser beam substantially. 

If there are N atoms in a source, each emitting light with intensity /, then the total intensity 
produced by an ordinary source is proportional to Nl, whereas in a laser source, it is 

O 

proportional to N^l. Considering that N is very large, we see that the light from a laser can be 


much stronger than that from an ordinary source. 


When astronauts of the Apollo missions visited the moon, they placed a mirror on its surface, 
facing the earth. Then scientists on the earth sent a strong laser beam, which was reflected 
by the mirror on the moon and received back on the earth. The size of the reflected laser 
beam and the time taken for the round trip were measured. This allowed a very accurate 
determination of (a) the extremely small divergence of a laser beam and (b) the distance of 
the moon from the earth. 


Summary 

1. Atom, as a whole, is electrically neutral and therefore contains equal amount of positive 
and negative charges. 

2. In Thomson’s model, an atom is a spherical cloud of positive charges with electrons 
embedded in it. 

3. In Rutherford’s model, most of the mass of the atom and all its positive charge are 
concentrated in a tiny nucleus (typically one by ten thousand the size of an atom), and the 
electrons revolve around it. 

4. Rutherford nuclear model has two main difficulties in explaining the structure of atom: (a) It 
predicts that atoms are unstable because the accelerated electrons revolving around the 
nucleus must spiral into the nucleus. This contradicts the stability of matter, (b) It cannot 
explain the characteristic line spectra of atoms of different elements. 

5. Atoms of each element are stable and emit characteristic spectrum. The spectrum consists 
of a set of isolated parallel lines termed as line spectrum. It provides useful information about 
the atomic structure. 


6. The atomic hydrogen emits a line spectrum consisting of various series. The frequency of 
any line in a series can be expressed as a difference of two terms; 


Lyman series: y - Rc 
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7. To explain the line spectra emitted by atoms, as well as the stability of atoms, Niel’s Bohr 
proposed a model for hydrogenic (single elctron) atoms. He introduced three postulates and 
laid the foundations of quantum mechanics: 


(a) In a hydrogen atom, an electron revolves in certain stable orbits (called stationary orbits) 
without the emission of radiant energy. 

(b) The stationary orbits are those for which the angular momentum is some integral multiple 
of h/2TT. (Bohr’s quantisation condition.) That is L = nh/2ir, where n is an integer called a 
quantum number. 


(c) The third postulate states that an electron might make a transition from one of its specified 
non-radiating orbits to another of lower energy. When it does so, a photon is emitted having 
energy equal to the energy difference between the initial and final states. The frequency (v) of 
the emitted photon is then given by 

hv = Ej - E f 


An atom absorbs radiation of the same frequency the atom emits, in which case the electron 
is transferred to an orbit with a higher value of n. 

Ej + hv = Ef 


8. As a result of the quantisation condition of angular momentum, the electron orbits the 
nucleus at only specific radii. For a hydrogen atom it is given by 
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The total energy is also quantised: 
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= -13.6eV/n“ 


The n = 1 state is called ground state. In hydrogen atom the ground state energy is -13.6 eV. 
Higher values of n correspond to excited states (n > 1). Atoms are excited to these higher 
states by collisions with other atoms or electrons or by absorption of a photon of right 
frequency. 

9. de Broglie’s hypothesis that electrons have a wavelength A = h/mv gave an explanation for 
Bohr’s quantised orbits by bringing in the wave-particle duality. The orbits correspond to 
circular standing waves in which the circumference of the orbit equals a whole number of 
wavelengths. 

10. Bohr’s model is applicable only to hydrogenic (single electron) atoms. It cannot be 
extended to even two electron atoms such as helium. This model is also unable to explain for 
the relative intensities of the frequencies emitted even by hydrogenic atoms. 


Points to Ponder 

1. Both the Thomson’s as well as the Rutherford’s models constitute an unstable system. 
Thomson’s model is unstable electrostatically, while Rutherford’s model is unstable because 
of electromagnetic radiation of orbiting electrons. 

2. What made Bohr quantise angular momentum (second postulate) and not some other 
quantity? Note, h has dimensions of angular momentum, and for circular orbits, angular 
momentum is a very relevant quantity. The second postulate is then so natural! 






3. The orbital picture in Bohr’s model of the hydrogen atom was inconsistent with the 
uncertainty principle. It was replaced by modern quantum mechanics in which Bohr’s orbits 
are regions where the electron may be found with large probability. 

4. Unlike the situation in the solar system, where planet-planet gravitational forces are very 
small as compared to the gravitational force of the sun on each planet (because the mass of 
the sun is so much greater than the mass of any of the planets), the electron-electron electric 
force interaction is comparable in magnitude to the electron-nucleus electrical force, because 
the charges and distances are of the same order of magnitude. This is the reason why the 
Bohr’s model with its planet-like electron is not applicable to many electron atoms. 

5. Bohr laid the foundation of the quantum theory by postulating specific orbits in which 
electrons do not radiate. Bohr’s model include only one quantum number n. The new theory 
called quantum mechanics supportes Bohr’s postulate. However in quantum mechanics 
(more generally accepted), a given energy level may not correspond to just one quantum 
state. For example, a state is characterised by four quantum numbers (n, I, m, and s), but for 
a pure Coulomb potential (as in hydrogen atom) the energy depends only on n. 

6. In Bohr model, contrary to ordinary classical expectation, the frequency of revolution of an 
electron in its orbit is not connected to the frequency of spectral line. The later is the 
difference between two orbital energies divided by h. For transitions between large quantum 
numbers (n to n - 1, n very large), however, the two coincide as expected. 

7. Bohr’s semiclassical model based on some aspects of classical physics and some aspects 
of modern physics also does not provide a true picture of the simplest hydrogenic atoms. The 
true picture is quantum mechanical affair which differs from Bohr model in a number of 
fundamental ways. But then if the Bohr model is not strictly correct, why do we bother about 
it? The reasons which make Bohr’s model still useful are: 

(i) The model is based on just three postulates but accounts for almostall the general features 
of the hydrogen spectrum. 

(ii) The model incorporates many of the concepts we have learnt inclassical physics 




(iii) The model demonstrates how a theoretical physicist occasionally must quite literally 
ignore certain problems of approach in hopes of being able to make some predictions. If the 
predictions of the theory or model agree with experiment, a theoretician then must somehow 
hope to explain away or rationalise the problems that were ignored along the way. 


C -"N 

Exercises 

12.1 Choose the correct alternative from the clues given at the end of 
the each statement: 

(a) The size of the atom in Thomson’s model is.the atomic size 

in Rutherford’s model, (much greater than/no different from/much less 
than.) 

(b) In the ground state of.electrons are in stable equilibrium, 

while in.electrons always experience a net force. (Thomson’s 

model/ Rutherford’s model.) 

(c) A classical atom based on . is doomed to collapse. 

(Thomson’s model/ Rutherford’s model.) 

(d) An atom has a nearly continuous mass distribution in a.but 

has a highly non-uniform mass distribution in . (Thomson’s 

model/ Rutherford’s model.) 

(e) The positively charged part of the atom possesses most of the 

mass in.(Rutherford’s model/both the models.) 

12.2 Suppose you are given a chance to repeat the alpha-particle 
scattering experiment using a thin sheet of solid hydrogen in place of 
the gold foil. (Hydrogen is a solid at temperatures below 14 K.) What 
results do you expect? 














12.3 What is the shortest wavelength present in the Paschen series 
of spectral lines? 

12.4 A difference of 2.3 eV separates two energy levels in an atom. 
What is the frequency of radiation emitted when the atom make a 
transition from the upper level to the lower level? 

12.5 The ground state energy of hydrogen atom is -13.6 eV. What 
are the kinetic and potential energies of the electron in this state? 

12.6 A hydrogen atom initially in the ground level absorbs a photon, 
which excites it to the n = 4 level. Determine the wavelength and 
frequency of photon. 

12.7 (a) Using the Bohr’s model calculate the speed of the electron in 
a hydrogen atom in the n = 1, 2, and 3 levels, (b) Calculate the orbital 
period in each of these levels. 

12.8 The radius of the innermost electron orbit of a hydrogen atom is 
5.3x10 -11 m. What are the radii of the n = 2 and n =3 orbits? 

12.9 A 12.5 eV electron beam is used to bombard gaseous hydrogen 
at room temperature. What series of wavelengths will be emitted? 

12.10 In accordance with the Bohr’s model, find the quantum number 
that characterises the earth’s revolution around the sun in an orbit of 

radius 1.5 * 10 11 m with orbital speed 3 * 10 4 m/s. (Mass of earth = 
6.0 x 10 24 kg.) 

Additional Exercises 

12.11 Answer the following questions, which help you understand the 
difference between Thomson’s model and Rutherford’s model better. 

(a) Is the average angle of deflection of a-particles by a thin gold foil 
predicted by Thomson’s model much less, about the same, or much 





greater than that predicted by Rutherford’s model? 

(b) Is the probability of backward scattering (i.e., scattering of 
a-particles at angles greater than 90°) predicted by Thomson’s model 
much less, about the same, or much greater than that predicted by 
Rutherford’s model? 

(c) Keeping other factors fixed, it is found experimentally that for 
small thickness t, the number of a-particles scattered at moderate 
angles is proportional to t. What clue does this linear dependence on 
t provide? 

(d) In which model is it completely wrong to ignore multiple scattering 
for the calculation of average angle of scattering of 

a-particles by a thin foil? 

12.12 The gravitational attraction between electron and proton in a 
hydrogen atom is weaker than the coulomb attraction by a factor of 

about 10 -40 . An alternative way of looking at this fact is to estimate 
the radius of the first Bohr orbit of a hydrogen atom if the electron and 
proton were bound by gravitational attraction. You will find the answer 
interesting. 

12.13 Obtain an expression for the frequency of radiation emitted 
when a hydrogen atom de-excites from level n to level (n-1). For 
large n, show that this frequency equals the classical frequency of 
revolution of the electron in the orbit. 

12.14 Classically, an electron can be in any orbit around the nucleus 
of an atom. Then what determines the typical atomic size? Why is an 
atom not, say, thousand times bigger than its typical size? The 
question had greatly puzzled Bohr before he arrived at his famous 
model of the atom that you have learnt in the text. To simulate what 





he might well have done before his discovery, let us play as follows 
with the basic constants of nature and see if we can get a quantity 
with the dimensions of length that is roughly equal to the known size 

of an atom (~ 10 -10 m). 

(a) Construct a quantity with the dimensions of length from the 
fundamental constants e, m e , and c. Determine its numerical value. 

(b) You will find that the length obtained in (a) is many orders of 
magnitude smaller than the atomic dimensions. Further, it involves c. 
But energies of atoms are mostly in non-relativistic domain where c is 
not expected to play any role. This is what may have suggested Bohr 
to discard c and look for ‘something else’ to get the right atomic size. 
Now, the Planck’s constant h had already made its appearance 
elsewhere. Bohr’s great insight lay in recognising that h, m e , and e 

will yield the right atomic size. Construct a quantity with the 
dimension of length from h, m e , and e and confirm that its numerical 

value has indeed the correct order of magnitude. 

12.15 The total energy of an electron in the first excited state of the 
hydrogen atom is about -3.4 eV. 

(a) What is the kinetic energy of the electron in this state? 

(b) What is the potential energy of the electron in this state? 

(c) Which of the answers above would change if the choice of the 
zero of potential energy is changed? 

12.16 If Bohr’s quantisation postulate (angular momentum = nh/2u) is 
a basic law of nature, it should be equally valid for the case of 
planetary motion also. Why then do we never speak of quantisation 
of orbits of planets around the sun? 




12.17 Obtain the first Bohr’s radius and the ground state energy of a 
muonic hydrogen atom [i.e., an atom in which a negatively charged 

muon (p _ ) of mass about 207m e orbits around a proton], 

_ J 
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Chapter Thirteen 


Nuclei 


13.1 Introduction 

In the previous chapter, we have learnt that in every atom, the positive 
charge and mass are densely concentrated at the centre of the atom 
forming its nucleus. The overall dimensions of a nucleus are much 
smaller than those of an atom. Experiments on scattering of a-particles 
demonstrated that the radius of a nucleus was smaller than the radius of 

an atom by a factor of about 10 4 . This means the volume of a nucleus is 

about 10 -12 times the volume of the atom. In other words, an atom is 
almost empty. If an atom is enlarged to the size of a classroom, the 
nucleus would be of the size of pinhead. Nevertheless, the nucleus 
contains most (more than 99.9%) of the mass of an atom. 

Does the nucleus have a structure, just as the atom does? If so, what are 
the constituents of the nucleus? How are these held together? In this 
chapter, we shall look for answers to such questions. We shall discuss 
various properties of nuclei such as their size, mass and stability, and 
also associated nuclear phenomena such as radioactivity, fission and 
fusion. 

13.2 Atomic Masses and Composition of Nucleus 

The mass of an atom is very small, compared to a kilogram; for example, 

the mass of a carbon atom, 12 C, is 1.992647 x 10 -26 kg. Kilogram is not a 
very convenient unit to measure such small quantities. Therefore, a 




different mass unit is used for expressing atomic masses. This unit is the 

atomic mass unit (u), defined as 1/12 th of the mass of the carbon ( 12 C) 
atom. According to this definition 

We are quite often interested in the decay rate R (= -d/V/df) than in N 
itself. It gives us the number of nuclei decaying per unit time. For 
example, suppose we have some amount of a radioactive substance. We 
need not know the number of nuclei present in it. But we may measure 
the number of emissions of a, (3 or y particles in a given time interval, say 
10 seconds or 20 seconds. Suppose we make measurements for a time 
interval dt and get a decay count in our instruments equal to AN (=-6N ). 
Then the decay rate R is defined as 

R = -™ 
di 

Differentiating Eq. (13.14), we find 

_ diV ... _ xt 
R = - = KN 0 e At 

dt 

or, R = R 0 e ~ At (13.15) 

which is an alternative form of the law of radioactive decay [Eq. (13.14)]. 
Here R 0 is the radioactive decay rate at time t = 0, and R is the rate at 

any subsequent time t. We can now write Eq. (13.10) in terms of the 
decay rate R of the sample as 

R = KN (13.16) 

where R and the number of radioactive nuclei that have not yet 
undergone decay must be evaluated at the same instant. 

The total decay rate R of a sample of one or more radionuclides is called 
the activity of that sample. The SI unit for activity is becquerel, named 



after the discoverer of radioactivity, Henry Becquerel. It is defined as 

, mass of one atom 

lu = 

12 

_ 1.992(M7 xUr' 2 '" kg 
12 

= 1,660339 xlO 27 kg (13.1) 

The atomic masses of various elements expressed in atomic mass unit 
(u) are close to being integral multiples of the mass of a hydrogen atom. 
There are, however, many striking exceptions to this rule. For example, 
the atomic mass of chlorine atom is 35.46 u. 

Accurate measurement of atomic masses is carried out with a mass 
spectrometer, The measurement of atomic masses reveals the existence 
of different types of atoms of the same element, which exhibit the same 
chemical properties, but differ in mass. Such atomic species of the same 
element differing in mass are called isotopes. (In Greek, isotope means 
the same place, i.e. they occur in the same place in the periodic table of 
elements.) It was found that practically every element consists of a 
mixture of several isotopes. The relative abundance of different isotopes 
differs from element to element. Chlorine, for example, has two isotopes 
having masses 34.98 u and 36.98 u, which are nearly integral multiples 
of the mass of a hydrogen atom. The relative abundances of these 
isotopes are 75.4 and 24.6 per cent, respectively. Thus, the average 
mass of a chlorine atom is obtained by the weighted average of the 
masses of the two isotopes, which works out to be 

_ 75,4x34.98+24,6x36.98 
I 00 


= 35.47 u 



which agrees with the atomic mass of chlorine. 

Even the lightest element, hydrogen has three isotopes having masses 
1.0078 u, 2.0141 u, and 3.0160 u. The nucleus of the lightest atom of 
hydrogen, which has a relative abundance of 99.985%, is called the 
proton. The mass of a proton is 

m 0 - 1.00727 u - 1.67202 x 10 _a7 kg (13.2) 

This is equal to the mass of the hydrogen atom (= 1.00783u), minus the 
mass of a single electron (m e = 0.00055 u). The other two isotopes of 

hydrogen are called deuterium and tritium. Tritium nuclei, being unstable, 
do not occur naturally and are produced artificially in laboratories. 

The positive charge in the nucleus is that of the protons. A proton carries 
one unit of fundamental charge and is stable. It was earlier thought that 
the nucleus may contain electrons, but this was ruled out later using 
arguments based on quantum theory. All the electrons of an atom are 
outside the nucleus. We know that the number of these electrons outside 
the nucleus of the atom is Z, the atomic number. The total charge of the 
atomic electrons is thus (-Ze), and since the atom is neutral, the charge 
of the nucleus is (+Ze). The number of protons in the nucleus of the atom 
is, therefore, exactly Z, the atomic number. 

Discovery of Neutron 

Since the nuclei of deuterium and tritium are isotopes of hydrogen, they 
must contain only one proton each. But the masses of the nuclei of 
hydrogen, deuterium and tritium are in the ratio of 1:2:3. Therefore, the 
nuclei of deuterium and tritium must contain, in addition to a proton, some 
neutral matter. The amount of neutral matter present in the nuclei of 
these isotopes, expressed in units of mass of a proton, is approximately 


equal to one and two, respectively. This fact indicates that the nuclei of 
atoms contain, in addition to protons, neutral matter in multiples of a 
basic unit. This hypothesis was verified in 1932 by James Chadwick who 
observed emission of neutral radiation when beryllium nuclei were 
bombarded with alpha-particles (a-particles are helium nuclei, to be 
discussed in a later section). It was found that this neutral radiation could 
knock out protons from light nuclei such as those of helium, carbon and 
nitrogen. The only neutral radiation known at that time was photons 
(electromagnetic radiation). Application of the principles of conservation 
of energy and momentum showed that if the neutral radiation consisted 
of photons, the energy of photons would have to be much higher than is 
available from the bombardment of beryllium nuclei with a-particles. The 
clue to this puzzle, which Chadwick satisfactorily solved, was to assume 
that the neutral radiation consists of a new type of neutral particles called 
neutrons. From conservation of energy and momentum, he was able to 
determine the mass of new particle ‘as very nearly the same as mass of 
proton’. 

The mass of a neutron is now known to a high degree of accuracy. It is 
m n = 1.00866 u = 1.6749x10- 27 kg (13.3) 

Chadwick was awarded the 1935 Nobel Prize in Physics for his discovery 
of the neutron. 

A free neutron, unlike a free proton, is unstable. It decays into a proton, 
an electron and a antineutrino (another elementary particle), and has a 
mean life of about 1000s. It is, however, stable inside the nucleus. 

The composition of a nucleus can now be described using the following 
terms and symbols: 

Z - atomic number = number of protons [13.4(a)] 



N - neutron number = number of neutrons [13.4(b)] 

A - mass number = Z + N = total number of protons and neutrons 
[13.4(c)] 

One also uses the term nucleon for a proton or a neutron. Thus the 
number of nucleons in an atom is its mass number A. 

Nuclear species or nuclides are shown by the notation X where X is 

the chemical symbol of the species. For example, the nucleus of gold is 
denoted by '^Au. It contains 197 nucleons, of which 79 are protons 
and the rest118 are neutrons. 

The composition of isotopes of an element can now be readily explained. 
The nuclei of isotopes of a given element contain the same number of 
protons, but differ from each other in their number of neutrons. 
Deuterium, Jh, which is an isotope of hydrogen, contains one proton 

and one neutron. Its other isotope tritium, y IT, contains one proton and 

two neutrons. The element gold has 32 isotopes, ranging from A =173 to 
A = 204. We have already mentioned that chemical properties of 
elements depend on their electronic structure. As the atoms of isotopes 
have identical electronic structure they have identical chemical behaviour 
and are placed in the same location in the periodic table. 

All nuclides with same mass number A are called isobars. For example, 
the nuclides j’H and ;J lie are isobars. Nuclides with same neutron 

number N but different atomic number Z, for example 1 1 Ig and An 

, are called isotones. 

13.3 Size of the Nucleus 

As we have seen in Chapter 12, Rutherford was the pioneer who 


postulated and established the existence of the atomic nucleus. At 
Rutherford’s suggestion, Geiger and Marsden performed their classic 
experiment: on the scattering of a-particles from thin gold foils. Their 
experiments revealed that the distance of closest approach to a gold 

nucleus of an a-particle of kinetic energy 5.5 MeV is about 4.0 * 10 -14 m. 
The scattering of a-particle by the gold sheet could be understood by 
Rutherford by assuming that the coulomb repulsive force was solely 
responsible for scattering. Since the positive charge is confined to the 

nucleus, the actual size of the nucleus has to be less than 4.0 * 10 -14 m. 

If we use a-particles of higher energies than 5.5 MeV, the distance of 
closest approach to the gold nucleus will be smaller and at some point 
the scattering will begin to be affected by the short range nuclear forces, 
and differ from Rutherford’s calculations. Rutherford’s calculations are 
based on pure coulomb repulsion between the positive charges of the a- 
particle and the gold nucleus. From the distance at which deviations set 
in, nuclear sizes can be inferred. 

By performing scattering experiments in which fast electrons, instead of 
a-particles, are projectiles that bombard targets made up of various 
elements, the sizes of nuclei of various elements have been accurately 
measured. 

It has been found that a nucleus of mass number A has a radius 
R = R 0 A VZ (13.5) 

where ft 0 = 1.2 x 10 -15 m (=1.2 fm; 1 fm = 10 -15 m). This means the 

volume of the nucleus, which is proportional to R 3 is proportional to A. 
Thus the density of nucleus is a constant, independent of A, for all nuclei. 
Different nuclei are like a drop of liquid of constant density. The density of 



nuclear matter is approximately 2.3 * 10 17 kg nrr 3 . This density is very 

large compared to ordinary matter, say water, which is 10 3 kg rrr 3 . This is 
understandable, as we have already seen that most of the atom is empty. 
Ordinary matter consisting of atoms has a large amount of empty space. 


Example 13.1 Given the mass of iron nucleus as 55.85u and A=56, find the nuclear density? 

Solution 

m c = 55.85, u = 9.27 x 10“ 26 kg 
Fe 


Nuclear density = 


mass 
vol 11 trie 


'3.27 x 10 


(■1 k / ':>) (1 .2x10 


1 



= 2.29 x io 17 kg m 3 


The density of matter in neutron stars (an astrophysical object) is comparable to this density. 
This shows that matter in these objects has been compressed to such an extent that they 
resemble a big nucleus. 


13.4 Mass-Energy and Nuclear Binding Energy 

13.4.1 Mass - Energy 

Einstein showed from his theory of special relativity that it is necessary to 
treat mass as another form of energy. Before the advent of this theory of 
special relativity it was presumed that mass and energy were conserved 
separately in a reaction. However, Einstein showed that mass is another 
form of energy and one can convert mass-energy into other forms of 
energy, say kinetic energy and vice-versa. 






Einstein gave the famous mass-energy equivalence relation 
E = me 2 (13.6) 

Here the energy equivalent of mass m is related by the above equation 
and c is the velocity of light in vacuum and is approximately equal to 

3x10 8 ms" 1 . 


Example 13.2 Calculate the energy equivalent of 1 g of substance. 

Solution 

Energy, E = 10 -3 * (3 * 10 8 ) 2 J 
E = 1CT 3 x 9 x 10 16 = 9 x 10 13 J 

Thus, if one gram of matter is converted to energy, there is a release of enormous amount of 
energy. 


Experimental verification of the Einstein’s mass-energy relation has been 
achieved in the study of nuclear reactions amongst nucleons, nuclei, 
electrons and other more recently discovered particles. In a reaction the 
conservation law of energy states that the initial energy and the final 
energy are equal provided the energy associated with mass is also 
included. This concept is important in understanding nuclear masses and 
the interaction of nuclei with one another. They form the subject matter of 
the next few sections. 

13.4.2 Nuclear binding energy 

In Section 13.2 we have seen that the nucleus is made up of neutrons 




and protons. Therefore it may be expected that the mass of the nucleus 
is equal to the total mass of its individual protons and neutrons. However, 
the nuclear mass M is found to be always less than this. For example, let 
us consider 'J’O; a nucleus which has 8 neutrons and 8 protons. We 

have 

Mass of 8 neutrons = 8 x 1.00866 u 
Mass of 8 protons = 8 x 1.00727 u 
Mass of 8 electrons = 8 x 0.00055 u 
Therefore the expected mass of 'J?o nucleus 
= 8 x 2.01593 u = 16.12744 u. 

The atomic mass of found from mass spectroscopy experiments is 

seen to be 15.99493 u. Substracting the mass of 8 electrons (8 x 
0.00055 u) from this, we get the experimental mass of 'J’O nucleus to be 
15.99053 u. 

Thus, we find that the mass of the '“o nucleus is less than the total 

mass of its constituents by 0.13691 u. The difference in mass of a nucleus 
and its constituents, AM, is called the mass defect, and is given by 

AM =[Zm p + {A -Z) rn^ ] -M (13.7) 

What is the meaning of the mass defect? It is here that Einstein’s 
equivalence of mass and energy plays a role. Since the mass of the 
oxygen nucleus is less that the sum of the masses of its constituents (8 
protons and 8 neutrons, in the unbound state), the equivalent energy of 
the oxygen nucleus is less than that of the sum of the equivalent energies 
of its constituents. If one wants to break the oxygen nucleus into 8 

protons and 8 neutrons, this extra energy AM c 2 , has to supplied. This 



energy required E b is related to the mass defect by 
E b = A Me 2 (13.8) 


Example 13.3 Find the energy equivalent of one atomic mass unit, first in Joules and then in 
MeV. Using this, express the mass defect of Q in MeV/c 2 . 

Solution 

1u = 1.6605 x 10- 27 kg 

O 

To convert it into energy units, we multiply it by c and find that 
energy equivalent = 1.6605 x 10 -27 x (2.9979 * 10®) 2 kg m 2 /s 2 
= 1.4924 x 10 -10 J 

1,4924x10 1 ' 

= . c-V 

1,602 x I 0 

= 0.9315 x io 9 eV 
= 931.5 MeV 

or, 1u = 931.5 MeV/c 2 

For 1 0 O , AM = 0.13691 u = 0.13691 x931.5 MeV/c 2 
= 127.5 MeV/c 2 

The energy needed to separate "O into its constituents is thus 127.5 
MeV/c 2 . 




If a certain number of neutrons and protons are brought together to form 
a nucleus of a certain charge and mass, an energy E b will be released in 

the process. The energy E b is called the binding energy of the nucleus. If 

we separate a nucleus into its nucleons, we would have to supply a total 
energy equal to E b , to those particles. Although we cannot tear apart a 

nucleus in this way, the nuclear binding energy is still a convenient 
measure of how well a nucleus is held together. A more useful measure 
of the binding between the constituents of the nucleus is the binding 
energy per nucleon, E bn , which is the ratio of the binding energy E b of a 

nucleus to the number of the nucleons, A, in that nucleus: 


E bn =E b /A (13.9) 



Figure 13.1 The binding energy per nucleon 
as a function of mass number. 


We can think of binding energy per nucleon as the average energy per 
nucleon needed to separate a nucleus into its individual nucleons. 





















Figure 13.1 is a plot of the binding energy per nucleon E bn versus the 

mass number A for a large number of nuclei. We notice the following 
main features of 
the plot: 

(i) the binding energy per nucleon, E bn , is practically constant, i.e. 

practically independent of the atomic number for nuclei of middle mass 
number ( 30 < A < 170). The curve has a maximum of about 8.75 MeV for 
A = 56 and has a value of 7.6 MeV for A = 238. 

(ii; E bn is lower for both light nuclei (>A<30) and heavy nuclei (A>M0). 

We can draw some conclusions from these two observations: 

(i) The force is attractive and sufficiently strong to produce a binding 
energy of a few MeV per nucleon. 

(ii) The constancy of the binding energy in the range 30 < A < 170 is a 
consequence of the fact that the nuclear force is short-ranged. Consider 
a particular nucleon inside a sufficiently large nucleus. It will be under the 
influence of only some of its neighbours, which come within the range of 
the nuclear force. If any other nucleon is at a distance more than the 
range of the nuclear force from the particular nucleon it will have no 
influence on the binding energy of the nucleon under consideration. If a 
nucleon can have a maximum of p neighbours within the range of nuclear 
force, its binding energy would be proportional to p. Let the binding 
energy of the nucleus be p/c, where k is a constant having the dimensions 
of energy. If we increase A by adding nucleons they will not change the 
binding energy of a nucleon inside. Since most of the nucleons in a large 
nucleus reside inside it and not on the surface, the change in binding 
energy per nucleon would be small. The binding energy per nucleon is a 
constant and is approximately equal to pk. The property that a given 



nucleon influences only nucleons close to it is also referred to as 
saturation property of the nuclear force. 

(iii) A very heavy nucleus, say A = 240, has lower binding energy per 
nucleon compared to that of a nucleus with A = 120. Thus if a nucleus A 
= 240 breaks into two A = 120 nuclei, nucleons get more tightly bound. 
This implies energy would be released in the process. It has very 
important implications for energy production through fission, to be 
discussed later in Section 13.7.1. 

(iv) Consider two very light nuclei (A<10) joining to form a heavier 
nucleus. The binding energy per nucleon of the fused heavier nuclei is 
more than the binding energy per nucleon of the lighter nuclei. This 
means that the final system is more tightly bound than the initial system. 
Again energy would be released in such a process of fusion. This is the 
energy source of sun, to be discussed later in Section 13.7.3. 

13.5 Nuclear Force 

The force that determines the motion of atomic electrons is the familiar 
Coulomb force. In Section 13.4, we have seen that for average mass 
nuclei the binding energy per nucleon is approximately 8 MeV, which is 
much larger than the binding energy in atoms. Therefore, to bind a 
nucleus together there must be a strong attractive force of a totally 
different kind. It must be strong enough to overcome the repulsion 
between the (positively charged) protons and to bind both protons and 
neutrons into the tiny nuclear volume. We have already seen that the 
constancy of binding energy per nucleon can be understood in terms of 
its short-range. Many features of the nuclear binding force are 
summarised below. These are obtained from a variety of experiments 
carried out during 1930 to 1950. 
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Figure 13.2 Potential energy of a pair of nucleons as a function of their separation. Fora 
separation greater than rg, the force is attractive and for separations less than rg, the force is 

strongly repulsive. 
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(i) The nuclear force is much stronger than the Coulomb force acting 
between charges or the gravitational forces between masses. The 
nuclear binding force has to dominate over the Coulomb repulsive force 
between protons inside the nucleus. This happens only because the 
nuclear force is much stronger than the coulomb force. The gravitational 
force is much weaker than even Coulomb force. 

(ii) The nuclear force between two nucleons falls rapidly to zero as their 
distance is more than a few femtometres. This leads to saturation of 
forces in a medium or a large-sized nucleus, which is the reason for the 
constancy of the binding energy per nucleon. 

A rough plot of the potential energy between two nucleons as a function 
of distance is shown in the Fig. 13.2. The potential energy is a minimum 
at a distance r 0 of about 0.8 fm. This means that the force is attractive for 

distances larger than 0.8 fm and repulsive if they are separated by 












distances less than 0.8 fm. 

(iii) The nuclear force between neutron-neutron, proton-neutron and 
proton-proton is approximately the same. The nuclear force does not 
depend on the electric charge. 

Unlike Coulomb’s law or the Newton’s law of gravitation there is no 
simple mathematical form of the nuclear force. 

13.6 Radioactivity 

A. H. Becquerel discovered radioactivity in 1896 purely by accident. 
While studying the fluorescence and phosphorescence of compounds 
irradiated with visible light, Becquerel observed an interesting 
phenomenon. After illuminating some pieces of uranium-potassium 
sulphate with visible light, he wrapped them in black paper and separated 
the package from a photographic plate by a piece of silver. When, after 
several hours of exposure, the photographic plate was developed, it 
showed blackening due to something that must have been emitted by the 
compound and was able to penetrate both black paper and the silver. 

Experiments performed subsequently showed that radioactivity was a 
nuclear phenomenon in which an unstable nucleus undergoes a decay. 
This is referred to as radioactive decay. Three types of radioactive decay 
occur in nature : 

(i) a-decay in which a helium nucleus 'fie is emitted; 

(ii) (3-decay in which electrons or positrons (particles with the same mass 
as electrons, but with a charge exactly opposite to that of electron) are 
emitted; 

(iii) y-decay in which high energy (hundreds of keV or more) photons are 
emitted. 


Each of these decay will be considered in subsequent sub-sections. 


13.6.1 Law of radioactive decay 

In any radioactive sample, which undergoes a, (3 or y-decay, it is found 
that the number of nuclei undergoing the decay per unit time is 
proportional to the total number of nuclei in the sample. If N is the 
number of nuclei in the sample and AN undergo decay in time At then 


At 

or, AN/At = KN, (13.10) 

where A is called the radioactive decay constant or disintegration 
constant. 

The change in the number of nuclei in the sample* is dA/ = — AN in time 
At. Thus the rate of change of N is (in the limit At —► 0) 

* AN is the number of nuclei that decay, and hence is always positive. dN is the change in N, 
which may have either sign. Here it is negative, because out of original N nuclei, AN have 
decayed, leaving (N-AN) nuclei. 




Now, integrating both sides of the above equation,we get, 



dt (13.11) 


or, In A/ — In A/ 0 = -A (t- t 0 ) (13.12) 



Here N 0 is the number of radioactive nuclei in the sample at some 
arbitrary time f 0 and N is the number of radioactive nuclei at any 
subsequent time t. Setting t 0 = 0 and rearranging Eq. (13.12) gives us 

N 

In — = - / / (13.13) 

ji ^'o 

which gives 

N(t) = N 0 e~ ht (13.14) 

Note, for example, the light bulbs follow no such exponential decay law. If 
we test 1000 bulbs for their life (time span before they burn out or fuse), 
we expect that they will ‘decay’ (that is, burn out) at more or less the 
same time. The decay of radionuclides follows quite a different law, the 
law of radioactive decay represented by Eq. (13.14). 

The total decay rate R of a sample is the number of nuclei disintegrating 
per unit time. Suppose in a time interval dt, the decay count measured is 
AN. Then d/V = - AN. 

The positive quantity R is then defined as 








Figure 13.3 Exponential decay of a radioactive species. After a lapse ofTyg > population of the 

given species drops by a factor of 2. 


d i 

Differentiating Eq. (13.14), we get 
R = KN 0 e " At 


or, R = R 0 e -At (13.15) 

This is equivalant to the law of radioactivity decay, since you can 
integrate Eq. (13.15) to get back Eq. (13.14). Clearly, R 0 = AA/ 0 is the 

decay rate at t = 0. The decay rate R at a certain time t and the number 
of undecayed nuclei N at the same time are related by 

R = kN (13.16) 

The decay rate of a sample, rather than the number of radioactive nuclei, 
is a more direct experimentally measurable quantity and is given a 
specific name: activity. The SI unit for activity is becquerel, named after 
the discoverer of radioactivity, Henry Becquerel. 

1 becquerel is simply equal to 1 disintegration or decay per second. 
There is also another unit named “curie” that is widely used and is related 
to the SI unit as: 


Marie Sklodowska Curie (1867-1934) 




Marie Sklodowska Curie (1867-1934) Born in Poland. She is 
recognised both as a physicist and as a chemist. The discovery of 
radioactivity by Henri Becquerel in 1896 inspired Marie and her 
husband Pierre Curie in their researches and analyses which led to 
the isolation of radium and polonium elements. She was the first 
person to be awarded two Nobel Prizes- for Physics in 1903 and for 
Chemistry in 1911. 


1 curie = 1 Ci = 3.7 x io 10 decays per second 
= 3.7 x IO 10 Bq 

Different radionuclides differ greatly in their rate of decay. A common way 
to characterize this feature is through the notion of half-life. Half-life of a 
radionuclide (denoted by T 1/2 ) is the time it takes for a sample that has 

initially, say N 0 radionuclei to reduce to A/ 0 /2. Putting 

N = N 0 /2 and t = 7" 1/2 in Eq. (13.14), we get 



0.693 

I 


(13.17) 


Clearly if N 0 reduces to half its value in time 7" 1/2 , R 0 will also reduce to 
half its value in the same time according to Eq. (13.16). 






Another related measure is the average or mean life t. This again can be 
obtained from Eq. (13.14). The number of nuclei which decay in the time 

interval t to t + At is R(t)At (= kN 0 e~ ht At). Each of them has lived for time t. 

Thus the total life of all these nuclei would be t AA/ 0 e _M At. It is clear that 

some nuclei may live for a short time while others may live longer. 
Therefore to obtain the mean life, we have to sum (or integrate) this 
expression over all times from 0 to «, and divide by the total number N 0 

of nuclei at t = 0. Thus, 

t --- 

i 

One can show by performing this integral that 
t — 1/A 

We summarise these results with the following: 

In 2 

7-1/2= — = Tin 2 (13.18) 

A 

Radioactive elements (e.g., tritium, plutonium) which are short-lived i.e., 
have half-lives much less than the age of the universe (~15 billion years) 
have obviously decayed long ago and are not found in nature. They can, 
however, be produced artificially in nuclear reactions. 

Example 13.4 The half-life of 2 :^ [. j undergoing a-decay is 4.5 * 10® years. What is the 
activity of 1g sample of [; ? 

Solution 

T 1/2 = 4-5 X 1q 9 y 




= 4.5 x 10 9 y x 3.16 x 10 7 s/y 
= 1.42 x io 17 s 

One k mol of any isotope contains Avogadro’s number of atoms, and so 1g of ' . ^U 
contains 

kinol x 6.025 x 10 26 atoms/kmol 

238 x 10 ' 

= 25.3 x io 20 atoms. 

The decay rate R is 
R = AN 

_ 0.693 

T 

1/12 

= 1.23 x 10 4 s _1 
= 1.23 x 10 4 Bq 

Example 13.5 Tritium has a half-life of 12.5 y undergoing beta decay. What fraction of a 
sample of pure tritium will remain undecayed 
after 25 y. 

Solution 

By definition of half-life, half of the initial sample will remain undecayed after 12.5 y. In the 
next 12.5 y, one-half of these nuclei would have decayed. Hence, one fourth of the sample of 
the initial pure tritium will remain undecayed. 


0.693 x 25.3 xlQ- J 


1.42x10 


1 / 




13.6.2 Alpha decay 

A well-known example of alpha decay is the decay of uranium U to 
thorium Th with the emission of a helium nucleus ; I le 
2 ^U - *”Th + ^He (a-decay) (13.19) 

In a-decay, the mass number of the product nucleus (daughter nucleus) 
is four less than that of the decaying nucleus (parent nucleus), while the 
atomic number decreases by two. In general, a-decay of a parent 
nucleus £ X results in a daughter nucleus * _ 4 Y 

£x — A Z _ 4 Y + 4 He (13.20) 

From Einstein’s mass-energy equivalance relation [Eq. (13.6)] and 
energy conservation, it is clear that this spontaneous decay is possible 
only when the total mass of the decay products is less than the mass of 
the initial nucleus. This difference in mass appears as kinetic energy of 
the products. By referring to a table of nuclear masses, one can check 
that the total mass of 2: ’’ Th and ; I le is indeed less than that of 2 ;^U ■ 

The disintegration energy or the Q-value of a nuclear reaction is the 
difference between the initial mass energy and the total mass energy of 
the decay products. For a-decay 

Q = (m x -m Y - m He ) c (13.21) 

Q is also the net kinetic energy gained in the process or, if the initial 
nucleus X is at rest, the kinetic energy of the products. Clearly, Q> 0 for 
exothermic processes such as a-decay. 


Example 13.6 We are given the following atomic masses: 




[; = 238.05079 u ^ [ [r = 4.00260 u 
*“Th = 234.04363 u ;H= 1-00783 u 
fc'Pi.1 =237.05121 u 

Here the symbol Pa is for the element protactinium (Z = 91). 

(a) Calculate the energy released during the alpha decay of [ J. 

(b) Show that [; can not spontaneously emit a proton. 

Solution 

(a) The alpha decay of - ^ U is given by Eq. (13.20). The energy released in this process is 
given by 

Q = (My - M Th - M He ) c 2 

Substituting the atomic masses as given in the data, we find 
Q = (238.05079 - 234.04363 - 4.00260)u x c 2 

= (0.00456 u) c 2 
= (0.00456 u) (931.5 MeV/u) 

= 4.25 MeV. 

(b) If [; spontaneously emits a proton, the decay process would be 

2 :;n: + fH 

The Q for this process to happen is 
= (My - M Pa - M H ) c 2 

= (238.05079 - 237.05121 - 1.00783) u x c 2 




= (- 0.00825 u) c 2 
= - (0.00825 u)(931.5 MeV/u) 

= - 7.68 MeV 

Thus, the Q of the process is negative and therefore it cannot proceed spontaneously. We will 
have to supply an energy of 7.68 MeV to a /," [; nucleus to make it emit a proton. 


13.6.3 Beta decay 

In beta decay, a nucleus spontaneously emits an electron ((3 _ decay) or a 
positron ((3 + decay). A common example of (3 _ decay is 

"VP + c - V (13.22) 

and that of (3 + decay is 

Na. + + V (13.23) 

The decays are governed by the Eqs. (13.14) and (13.15), so that one 
can never predict which nucleus will undergo decay, but one can 
characterize the decay by a half-life T 1/2 . For example, T 1/2 for the 

decays above is respectively 14.3 d and 2.6y. The emission of electron in 

(3 _ decay is accompanied by the emission of an antineutrino (r); in (3 + 
decay, instead, a neutrino (v) is generated. Neutrinos are neutral particles 
with very small (possiblly, even zero) mass compared to electrons. They 
have only weak interaction with other particles. They are, therefore, very 
difficult to detect, since they can penetrate large quantity of matter (even 
earth) without any interaction. 

In both (3“ and (3 + decay, the mass number A remains unchanged. In (3~ 




decay, the atomic number Z of the nucleus goes up by 1, while in (3 + 

decay Z goes down by 1. The basic nuclear process underlying (3“ decay 
is the conversion of neutron to proton 

n —> p + e~ + v (13.24) 

while for (3 + decay, it is the conversion of proton into neutron 
p —> n + e + + v (13.25) 

Note that while a free neutron decays to proton, the decay of proton to 
neutron [Eq. (13.25)] is possible only inside the nucleus, since proton has 
smaller mass than neutron. 

13.6.4 Gamma decay 

Like an atom, a nucleus also has discrete energy levels - the ground 
state and excited states. The scale of energy is, however, very different. 
Atomic energy level spacings are of the order of eV, while the difference 
in nuclear energy levels is of the order of MeV. When a nucleus in an 
excited state spontaneously decays to its ground state (or to a lower 
energy state), a photon is emitted with energy equal to the difference in 
the two energy levels of the nucleus. This is the so-called gamma decay. 
The energy (MeV) corresponds to radiation of extremely short 
wavelength, shorter than the hard X-ray region. 

Typically, a gamma ray is emitted when a a or (3 decay results in a 
daughter nucleus in an excited state. This then returns to the ground 
state by a single photon transition or successive transitions involving 
more than one photon. A familiar example is the successive emmission of 
gamma rays of energies 1.17 MeV and 1.33 MeV from the deexcitation of 

Ni nuclei formed from (3~ decay of ™ Co - 


p 



E r = 1.17 MeV 


E,= 1.33 MeV 


28 

Figure 13.4 (3-decay of ' ' jsjj nucleus followed by emission of two y rays from deexcitation of the 

28 

daughter 
nucleus ^Ni ■ 


13.7 Nuclear Energy 

The curve of binding energy per nucleon E bn , given in Fig. 13.1, has a 

long flat middle region between A = 30 and A = 170. In this region the 
binding energy per nucleon is nearly constant (8.0 MeV). For the lighter 
nuclei region, A < 30, and for the heavier nuclei region, A > 170, the 
binding energy per nucleon is less than 8.0 MeV, as we have noted 
earlier. Now, the greater the binding energy, the less is the total mass of a 
bound system, such as a nucleus. Consequently, if nuclei with less total 
binding energy transform to nuclei with greater binding energy, there will 
be a net energy release. This is what happens when a heavy nucleus 
decays into two or more intermediate mass fragments ( fission ) or when 
light nuclei fuse into a havier nucleus (fusion.) 

Exothermic chemical reactions underlie conventional energy sources 
such as coal or petroleum. Here the energies involved are in the range of 








electron volts. On the other hand, in a nuclear reaction, the energy 
release is of the order of MeV. Thus for the same quantity of matter, 
nuclear sources produce a million times more energy than a chemical 

source. Fission of 1 kg of uranium, for example, generates 10 14 J of 
energy; compare it with burning of 1 kg of coal that gives 10 7 J. 


13.7.1 Fission 


New possibilities emerge when we go beyond natural radioactive decays 
and study nuclear reactions by bombarding nuclei with other nuclear 
particles such as proton, neutron, a-particle, etc. 


A most important neutron-induced nuclear reaction is fission. An example 
of fission is when a uranium isotope 2 iJU bombarded with a neutron 

breaks into two intermediate mass nuclear fragments 
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(13.26) 


The same reaction can produce other pairs of intermediate mass 
fragments 

!,n 4 “g £ 5 I J U TfSb + ®fNb - 4 J,n (13.27) 

Or, as another example, 


n + % U -> + £ Sr - 2 in (13.28) 


94 


The fragment products are radioactive nuclei; they emit (3 particles in 
succession to achieve stable end products. 

The energy released (the Q value ) in the fission reaction of nuclei like 
uranium is of the order of 200 MeV per fissioning nucleus. This is 
estimated as follows: 


Let us take a nucleus with A = 240 breaking into two fragments each of A 


= 120. Then 


E bn for A = 240 nucleus is about 7.6 MeV, 

E bn for the two A = 120 fragment nuclei is about 8.5 MeV. 

Gain in binding energy for nucleon is about 0.9 MeV. 

Hence the total gain in binding energy is 240x0.9 or 216 MeV. 

The disintegration energy in fission events first appears as the kinetic 
energy of the fragments and neutrons. Eventually it is transferred to the 
surrounding matter appearing as heat. The source of energy in nuclear 
reactors, which produce electricity, is nuclear fission. The enormous 
energy released in an atom bomb comes from uncontrolled nuclear 
fission. We discuss some details in the next section how a nuclear 
reactor functions. 

13.7.2 Nuclear reactor 

Notice one fact of great importance in the fission reactions given in Eqs. 
(13.26) to (13.28). There is a release of extra neutron (s) in the fission 
process. Averagely, 2V* neutrons are released per fission of uranium 
nucleus. It is a fraction since in some fission events 2 neutrons are 
produced, in some 3, etc. The extra neutrons in turn can initiate fission 
processes, producing still more neutrons, and so on. This leads to the 
possibility of a chain reaction, as was first suggested by Enrico Fermi. If 
the chain reaction is controlled suitably, we can get a steady energy 
output. This is what happens in a nuclear reactor. If the chain reaction is 
uncontrolled, it leads to explosive energy output, as in a nuclear bomb. 


India’s atomic energy programme 


The atomic energy programme in India was launched around the time of independence under 
the leadership of Homi J. Bhabha (1909-1966). An early historic achievement was the design 
and construction of the first nuclear reactor in India (named Apsara) which went critical on 
August 4, 1956. It used enriched uranium as fuel and water as moderator. Following this was 
another notable landmark: the construction of CIRUS (Canada India Research U.S.) reactor 
in 1960. This 40 MW reactor used natural uranium as fuel and heavy water as moderator. 
Apsara and CIRUS spurred research in a wide range of areas of basic and applied nuclear 
science. An important milestone in the first two decades of the programme was the 
indigenous design and construction of the plutonium plant at Trombay, which ushered in the 
technology of fuel reprocessing (separating useful fissile and fertile nuclear materials from the 
spent fuel of a reactor) in India. Research reactors that have been subsequently 
commissioned include ZERLINA, PURNIMA (I, II and III), DHRUVA and KAMINI. KAMINI is 
the country’s first large research reactor that uses U-233 as fuel. As the name suggests, the 
primary objective of a research reactor is not generation of power but to provide a facility for 
research on different aspects of nuclear science and technology. Research reactors are also 
an excellent source for production of a variety of radioactive isotopes that find application in 
diverse fields: industry, medicine and agriculture. 

The main objectives of the Indian Atomic Energy programme are to provide safe and reliable 
electric power for the country’s social and economic progress and to be self-reliant in all 
aspects of nuclear technology. Exploration of atomic minerals in India undertaken since the 
early fifties has indicated that India has limited reserves of uranium, but fairly abundant 
reserves of thorium. Accordingly, our country has adopted a three-stage strategy of nuclear 
power generation. The first stage involves the use of natural uranium as a fuel, with heavy 
water as moderator. The Plutonium-239 obtained from reprocessing of the discharged fuel 
from the reactors then serves as a fuel for the second stage — the fast breeder reactors. 
They are so called because they use fast neutrons for sustaining the chain reaction (hence 
no moderator is needed) and, besides generating power, also breed more fissile species 
(plutonium) than they consume. The third stage, most significant in the long term, involves 


using fast breeder reactors to produce fissile Uranium-233 from Thorium-232 and to build 
power reactors based on them. 

India is currently well into the second stage of the programme and considerable work has 
also been done on the third — the thorium utilisation — stage. The country has mastered the 
complex technologies of mineral exploration and mining, fuel fabrication, heavy water 
production, reactor design, construction and operation, fuel reprocessing, etc. Pressurised 
Heavy Water Reactors (PHWRs) built at different sites in the country mark the 
accomplishment of the first stage of the programme. India is now more than self-sufficient in 
heavy water production. Elaborate safety measures both in the design and operation of 
reactors, as also adhering to stringent standards of radiological protection are the hallmark of 
the Indian Atomic Energy Programme. 


There is, however, a hurdle in sustaining a chain reaction, as described 
here. It is known experimentally that slow neutrons (thermal neutrons) 
are much more likely to cause fission in -;jju than fast neutrons. Also 

fast neutrons liberated in fission would escape instead of causing another 
fission reaction. 

The average energy of a neutron produced in fission of U is 2 MeV. 

These neutrons unless slowed down will escape from the reactor without 
interacting with the uranium nuclei, unless a very large amount of 
fissionable material is used for sustaining the chain reaction. What one 
needs to do is to slow down the fast neutrons by elastic scattering with 
light nuclei. In fact, Chadwick’s experiments showed that in an elastic 
collision with hydrogen the neutron almost comes to rest and proton 
carries away the energy. This is the same situation as when a marble hits 
head-on an identical marble at rest. Therefore, in reactors, light nuclei 
called moderators are provided along with the fissionable nuclei for 
slowing down fast neutrons. The moderators commonly used are water, 


heavy water (D 2 0) and graphite. The Apsara reactor at the Bhabha 

Atomic Research Centre (BARC), Mumbai, uses water as moderator. The 
other Indian reactors, which are used for power production, use heavy 
water as moderator. 


Because of the use of moderator, it is possible that the ratio, K, of 
number of fission produced by a given generation of neutrons to the 
number of fission of the preceeding generation may be greater than one. 
This ratio is called the multiplication factor; it is the measure of the growth 
rate of the neutrons in the reactor. For K = 1, the operation of the reactor 
is said to be critical, which is what we wish it to be for steady power 
operation. If K becomes greater than one, the reaction rate and the 
reactor power increases exponentially. Unless the factor K is brought 
down very close to unity, the reactor will become supercritical and can 
even explode. The explosion of the Chernobyl reactor in Ukraine in 1986 
is a sad reminder that accidents in a nuclear reactor can be catastrophic. 


The reaction rate is controlled through control-rods made out of neutron¬ 
absorbing material such as cadmium. In addition to control rods, reactors 
are provided with safety rods which, when required, can be inserted into 
the reactor and K can be reduced rapidly to less than unity. 

The more abundant isotope 2 ^U in naturally occurring uranium is non- 

fissionable. When it captures a neutron, it produces the highly radioactive 
plutonium through these reactions 
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Plutonium undergoes fission with slow neutrons. 

Figure 13.5 shows the schematic diagram of a nuclear reactor based on 



thermal neutron fission. The core of the reactor is the site of nuclear 
fission. It contains the fuel elements in suitably fabricated form. The fuel 
may be say enriched uranium (i.e., one that has greater abundance of 
than naturally occurring uranium). The core contains a moderator 

to slow down the neutrons. The core is surrounded by a reflector to 
reduce leakage. The energy (heat) released in fission is continuously 
removed by a suitable coolant. A containment vessel prevents the 
escape of radioactive fission products. The whole assembly is shielded to 
check harmful radiation from coming out. The reactor can be shut down 
by means of rods (made of, for example, cadmium) that have high 
absorption of neutrons. The coolant transfers heat to a working fluid 
which in turn may produce stream. The steam drives turbines and 
generates electricity. 





Figure 13.5 Schematic diagram of a nuclear reactor based on thermal neutron fission. 

Like any power reactor, nuclear reactors generate considerable waste 
products. But nuclear wastes need special care for treatment since they 
are radioactive and hazardous. Elaborate safety measures, both for 
reactor operation as well as handling and reprocessing the spent fuel, are 
required. These safety measures are a distinguishing feature of the 
Indian Atomic Energy programme. An appropriate plan is being evolved 































to study the possibility of converting radioactive waste into less active 
and short-lived material. 

13.7.3 Nuclear fusion - energy generation in stars 

When two light nuclei fuse to form a larger nucleus, energy is released, 
since the larger nucleus is more tightly bound, as seen from the binding 
energy curve in Fig. 13.1. Some examples of such energy liberating 
nuclear fusion reactions are : 

!ll + |ll^n + e + + v + 0.42 MeV [13.29(a)] 

*II + “II >;;ITe + n + 3.27 MeV [13.29(b)] 

III + ill > *11 + \U + 4.03 MeV [13.29(c)] 

In the first reaction, two protons combine to form a deuteron and a 
positron with a release of 0.42 MeV energy. In reaction [13.29(b)], two 
deuterons combine to form the light isotope of helium. In reaction 
(13.29c), two deuterons combine to form a triton and a proton. For fusion 
to take place, the two nuclei must come close enough so that attractive 
short-range nuclear force is able to affect them. However, since they are 
both positively charged particles, they experience coulomb repulsion. 
They, therefore, must have enough energy to overcome this coulomb 
barrier. The height of the barrier depends on the charges and radii of the 
two interacting nuclei. It can be shown, for example, that the barrier 
height for two protons is - 400 keV, and is higher for nuclei with higher 
charges. We can estimate the temperature at which two protons in a 
proton gas would (averagely) have enough energy to overcome the 
coulomb barrier: 

(3/2)/c T=K= 400 keV, which gives T ~ 3 * 10 9 K. 


When fusion is achieved by raising the temperature of the system so that 
particles have enough kinetic energy to overcome the coulomb repulsive 
behaviour, it is called thermonuclear fusion. 

Thermonuclear fusion is the source of energy output in the interior of 

stars. The interior of the sun has a temperature of 1.5><10 7 K, which is 
considerably less than the estimated temperature required for fusion of 
particles of average energy. Clearly, fusion in the sun involves protons 
whose energies are much above the average energy. 

The fusion reaction in the sun is a multi-step process in which the 
hydrogen is burned into helium. Thus, the fuel in the sun is the hydrogen 
in its core. The proton-proton (p, p) cycle by which this occurs is 
represented by the following sets of reactions: 

111 i :u^-ll + e + + v + 0.42 MeV (i) 

e + + e _ —>y + Y + 1 02 MeV (ii) 

; '11 + ;H -s^lIe+Y + 5.49 MeV (iii) 

|llc -flic -7^IIc + }lI+}[I+12.86MeV (iv) (13.30) 

For the fourth reaction to occur, the first three reactions must occur twice, 
in which case two light helium nuclei unite to form ordinary helium 
nucleus. If we consider the combination 2(i) + 2(ii) + 2(iii) +(iv), the net 
effect is 

4;H + 2e" -» ^He + 2i' + 6r + 26.7MeV 

or (4jH + 4e’) -4 (|He + 2e‘) - 2v+ 6j- - 26.7MeV (13.31) 

Thus, four hydrogen atoms combine to form an atom with a release 
of 26.7 MeV of energy. 



Helium is not the only element that can be synthesized in the interior of a 
star. As the hydrogen in the core gets depleted and becomes helium, the 
core starts to cool. The star begins to collapse under its own gravity 
which increases the temperature of the core. If this temperature 

increases to about 10 8 K, fusion takes place again, this time of helium 
nuclei into carbon. This kind of process can generate through fusion 
higher and higher mass number elements. But elements more massive 
than those near the peak of the binding energy curve in Fig. 13.1 cannot 
be so produced. 

The age of the sun is about 5><10 9 y and it is estimated that there is 
enough hydrogen in the sun to keep it going for another 5 billion years. 
After that, the hydrogen burning will stop and the sun will begin to cool 
and will start to collapse under gravity, which will raise the core 
temperature. The outer envelope of the sun will expand, turning it into the 
so called red giant. 


Nuclear holocaust 

In a single uranium fission about 0.9x235 MeV (=200 MeV) of energy is liberated. If each 
nucleus of about 50 kg of undergoes fission the amount of energy involved is about 4 x 

10 18 J. This energy is equivalent to about 20,000 tons of TNT, enough for a superexplosion. 
Uncontrolled release of large nuclear energy is called an atomic explosion. On August 6, 
1945 an atomic device was used in warfare for the first time. The US dropped an atom bomb 
on Hiroshima, Japan. The explosion was equivalent to 20,000 tons of TNT Instantly the 
radioactive products devastated 10 sq km of the city which had 3,43,000 inhabitants. Of this 
number 66,000 were killed and 69,000 were injured; more than 67% of the city’s structures 
were destroyed. 


High temperature conditions for fusion reactions can be created by exploding a fission bomb. 


Super-explosions equivalent to 10 megatons of explosive power of TNT were tested in 1954. 
Such bombs which involve fusion of isotopes of hydrogen, deuterium and tritium are called 
hydrogen bombs. It is estimated that a nuclear arsenal sufficient to destroy every form of life 
on this planet several times over is in position to be triggered by the press of a button. Such a 
nuclear holocaust will not only destroy the life that exists now but its radioactive fallout will 
make this planet unfit for life for all times. Scenarios based on theoretical calculations predict 
a long nuclear winter, as the radioactive waste will hang like a cloud in the earth’s 
atmosphere and will absorb the sun’s radiation. 


13.7.4 Controlled thermonuclear fusion 

The natural thermonuclear fusion process in a star is replicated in a 
thermonuclear fusion device. In controlled fusion reactors, the aim is to 
generate steady power by heating the nuclear fuel to a temperature in the 

range of 10 8 K. At these temperatures, the fuel is a mixture of positive 
ions and electrons (plasma). The challenge is to confine this plasma, 
since no container can stand such a high temperature. Several countries 
around the world including India are developing techniques in this 
connection. If successful, fusion reactors will hopefully supply almost 
unlimited power to humanity. 


Example 13.7 Answer the following questions: 

(a) Are the equations of nuclear reactions (such as those given in Section 13.7) ‘balanced’ in 
the sense a chemical equation (e.g., 2 H 2 + O 2 —> 2 H 2 O) is? If not, in what sense are they 
balanced on both sides? 

(b) If both the number of protons and the number of neutrons are conserved in each nuclear 
reaction, in what way is mass converted into energy (or vice-versa) in a nuclear reaction? 




(c) A general impression exists that mass-energy interconversion takes place only in nuclear 
reaction and never in chemical reaction. This is strictly speaking, incorrect. Explain. 

Solution 

(a) A chemical equation is balanced in the sense that the number of atoms of each element is 
the same on both sides of the equation. A chemical reaction merely alters the original 
combinations of atoms. In a nuclear reaction, elements may be transmuted. Thus, the 
number of atoms of each element is not necessarily conserved in a nuclear reaction. 
However, the number of protons and the number of neutrons are both separately conserved 
in a nuclear reaction. [Actually, even this is not strictly true in the realm of very high energies 
- what is strictly conserved is the total charge and total ‘baryon number’. We need not pursue 
this matter here.] 

In nuclear reactions (e.g., Eq. 13.26), the number of protons and the number of neutrons are 
the same on the two sides of the equation. 

(b) We know that the binding energy of a nucleus gives a negative contribution to the mass of 
the nucleus (mass defect). Now, since proton number and neutron number are conserved in 
a nuclear reaction, the total rest mass of neutrons and protons is the same on either side of a 
reaction. But the total binding energy of nuclei on the left side need not be the same as that 
on the right hand side. The difference in these binding energies appears as energy released 
or absorbed in a nuclear reaction. Since binding energy contributes to mass, we say that the 
difference in the total mass of nuclei on the two sides get converted into energy or vice-versa. 
It is in these sense that a nuclear reaction is an example of mass-energy interconversion. 

(c) From the point of view of mass-energy interconversion, a chemical reaction is similar to a 
nuclear reaction in principle. The energy released or absorbed in a chemical reaction can be 
traced to the difference in chemical (not nuclear) binding energies of atoms and molecules on 
the two sides of a reaction. Since, strictly speaking, chemical binding energy also gives a 
negative contribution (mass defect) to the total mass of an atom or molecule, we can equally 
well say that the difference in the total mass of atoms or molecules, on the two sides of the 
chemical reaction gets converted into energy or vice-versa. However, the mass defects 




involved in a chemical reaction are almost a million times smaller than those in a nuclear 
reaction.This is the reason for the general impression, (which is incorrect) that mass-energy 
interconversion does not take place in a chemical reaction. 


Summary 

1. An atom has a nucleus. The nucleus is positively charged. The radius of the nucleus is 

smaller than the radius of an atom by a factor of 10^. More than 99.9% mass of the atom is 
concentrated in the nucleus. 

2. On the atomic scale, mass is measured in atomic mass units (u). By definition, 1 atomic 
mass unit (1u) is 1/12^ mass of one atom of ^ 2 C; 

1u = 1.660563 x 10- 27 kg. 

3. A nucleus contains a neutral particle called neutron. Its mass is almost the same as that of 
proton 

4. The atomic number Z is the number of protons in the atomic nucleus of an element. The 
mass number A is the total number of protons and neutrons in the atomic nucleus; A = Z+N; 
Here N denotes the number of neutrons in the nucleus. 

A nuclear species or a nuclide is represented as ; X > where X is the chemical symbol of the 
species. 

Nuclides with the same atomic number Z, but different neutron number N are called isotopes. 
Nuclides with the same A are isobars and those with the same N are isotones. 

Most elements are mixtures of two or more isotopes. The atomic mass of an element is a 
weighted average of the masses of its isotopes. The masses are the relative abundances of 
the isotopes. 

5. A nucleus can be considered to be spherical in shape and assigned a radius. Electron 
scattering experiments allow determination of the nuclear radius; it is found that radii of nuclei 




fit the formula 


R = R 0 A 1/3 , 

where Rg = a constant = 1.2 fm. This implies that the nuclear density is independent of A. It is 
of the order of 10^ kg/m 3 . 

6. Neutrons and protons are bound in a nucleus by the short-range strong nuclear force. The 
nuclear force does not distinguish between neutron and proton. 

7. The nuclear mass M is always less than the total mass, Im, of its constituents. The 
difference in mass of a nucleus and its constituents is called the mass defect, 

AM = (Z mp + (A - Z)m n ) - M 

Using Einstein’s mass energy relation, we express this mass difference in terms of energy as 
AE b = AM c2 

The energy AE b represents the binding energy of the nucleus. In the mass number range A = 
30 to 170, the binding energy per nucleon is nearly constant, about 8 MeV/nucleon. 

8. Energies associated with nuclear processes are about a million times larger than chemical 
process. 

9. The Q-value of a nuclear process is 

Q = final kinetic energy - initial kinetic energy. 

Due to conservation of mass-energy, this is also, 

O 

Q = (sum of initial masses - sum of final masses)c 

10. Radioactivity is the phenomenon in which nuclei of a given species transform by giving 
out a or 3 or y rays; a-rays are helium nuclei; 

3-rays are electrons, y-rays are electromagnetic radiation of wavelengths shorter than X-rays; 

11. Law of radioactive decay : N (t) = N(0) e - At 


where A is the decay constant or disintegration constant. 


The half-life T- 1/2 of a radionuclide is the time in which N has been reduced to one-half of its 

A 

initial value. The mean life t is the time at which N has been reduced to e of its initial value 


/ 

12. Energy is released when less tightly bound nuclei are transmuted into more tightly bound 
nuclei. In fission, a heavy nucleus like ,,^’11 breaks into two smaller fragments, e.g., 

U+--11 -V™ Sb + ? Xb + 4 ],n 

13. The fact that more neutrons are produced in fission than are consumed gives the 
possibility of a chain reaction with each neutron that is produced triggering another fission. 
The chain reaction is uncontrolled and rapid in a nuclear bomb explosion. It is controlled and 
steady in a nuclear reactor. In a reactor, the value of the neutron multiplication factor k is 
maintained at 1. 

14. In fusion, lighter nuclei combine to form a larger nucleus. Fusion of hydrogen nuclei into 
helium nuclei is the source of energy of all stars including our sun. 


Physical Quantity 

Symbol 

Dimensions 

Units 

Remarks 

Atomic mass unit 


[M| 

u 

Unit of mass lor 
expressing atomic or 
nuclear masses. One 
atomic mass unit equals 

1/ 12 th of the mass of i2 C 
atom. 

Disintegration or 
decay constant 

A 

rr-'i 

s- 1 


Half-life 

T 

1 1/2 

|T1 

s 

Time taken for the decay 
of one-half of the initial 
number of nuclei present 
in a radioactive sample. 

Mean life 

T 

IT] 

s 

Time at which number of 
nuclei has been reduced to 
e _1 of Its Initial value 

Activity of a radio¬ 
active sample 

R 

IT 1 1 

Bq 

Measure of the activity 
of a radioactive source. 








Points to Ponder 

1. The density of nuclear matter is independent of the size of the nucleus. The mass density 
of the atom does not follow this rule. 

2. The radius of a nucleus determined by electron scattering is found to be slightly different 
from that determined by alpha-particle scattering. This is because electron scattering senses 
the charge distribution of the nucleus, whereas alpha and similar particles sense the nuclear 
matter. 

O 

3. After Einstein showed the equivalence of mass and energy, E = me , we cannot any longer 
speak of separate laws of conservation of mass and conservation of energy, but we have to 
speak of a unified law of conservation of mass and energy. The most convincing evidence 
that this principle operates in nature comes from nuclear physics. It is central to our 
understanding of nuclear energy and harnessing it as a source of power. Using the principle, 
Q of a nuclear process (decay or reaction) can be expressed also in terms of initial and final 
masses. 

4. The nature of the binding energy (per nucleon) curve shows that exothermic nuclear 
reactions are possible, when two light nuclei fuse or when a heavy nucleus undergoes fission 
into nuclei with intermediate mass. 

5. For fusion, the light nuclei must have sufficient initial energy to overcome the coulomb 
potential barrier. That is why fusion requires very high temperatures. 

6. Although the binding energy (per nucleon) curve is smooth and slowly varying, it shows 

peaks at nuclides like ^He, etc. This is considered as evidence of atom-like shell 
structure in nuclei. 

7. Electrons and positron are a particle-antiparticle pair. They are identical in mass; their 
charges are equal in magnitude and opposite. (It is found that when an electron and a 
positron come together, they annihilate each other giving energy in the form of gamma-ray 
photons.) 




8. In 3"-decay (electron emission), the particle emitted along with electron is anti-neutrino ( y 

). On the other hand, the particle emitted in 3 + -decay (positron emission) is neutrino (v). 
Neutrino and anti-neutrino are a particle-antiparticle pair. There are anti particles associated 
with every particle. What should be antiproton which is the anti particle of the proton? 

9. A free neutron is unstable (n > p -f p + )>)■ But a similar free proton decay is not 
possible, since a proton is (slightly) lighter than a neutron. 

10. Gamma emission usually follows alpha or beta emission. A nucleus in an excited (higher) 
state goes to a lower state by emitting a gamma photon. A nucleus may be left in an excited 
state after alpha or beta emission. Successive emission of gamma rays from the same 

nucleus (as in case of 39 Ni, Fig. 13.4) is a clear proof that nuclei also have discrete energy 
levels as do the atoms. 

11. Radioactivity is an indication of the instability of nuclei. Stability requires the ratio of 
neutron to proton to be around 1:1 for light nuclei. This ratio increases to about 3:2 for heavy 
nuclei. (More neutrons are required to overcome the effect of repulsion among the protons.) 
Nuclei which are away from the stability ratio, i.e., nuclei which have an excess of neutrons or 
protons are unstable. In fact, only about 10% of knon isotopes (of all elements), are stable. 
Others have been either artificially produced in the laboratory by bombarding a, p, d, n or 
other particles on targets of stable nuclear species or identified in astronomical observations 
of matter in the universe. 


r -_______- \ 

Exercises 

You may find the following data useful in solving the exercises: 
e = 1.6 x 10- 19 C 
N = 6.023x10 23 per mole 







1/(4tt£ 0 ) = 9 x 10 9 N m 2 /C 2 

k = I.SSIxlO- 23 ^- 1 
1 MeV = 1.6 x 10- 13 J 
1 u = 931.5 MeV/c 2 

1 year = 3.154xl0 7 s 
m H = 1.007825 u 

m n = 1.008665 u 

m( ^ He ) = 4.002603 u 
m e = 0.000548 u 

13.1 (a) Two stable isotopes of lithium ( ’u and 'Li have respective 
abundances of 7.5% and 92.5%. These isotopes have masses 
6.01512 u and 7.01600 u, respectively. Find the atomic mass of 
lithium. 

(b) Boron has two stable isotopes, '”B and ''B- Their respective 

masses are 10.01294 u and 11.00931 u, and the atomic mass of 
boron is 10.811 u. Find the abundances of and b - 

13.2 The three stable isotopes of neon: fjNc and ^No 

have respective abundances of 90.51%, 0.27% and 9.22%. The 
atomic masses of the three isotopes are 19.99 u, 20.99 u and 21.99 
u, respectively. Obtain the average atomic mass of neon. 

13.3 Obtain the binding energy (in MeV) of a nitrogen nucleus ('; Nj, 
given m ( N)=14.00307 u 





13.4 Obtain the binding energy of the nuclei ™Fe and ^Bi in units 
of MeV from the following data: 

m (™ Fc) = 55.934939 u m (Hi) = 208.980388 u 

13.5 A given coin has a mass of 3.0 g. Calculate the nuclear energy 
that would be required to separate all the neutrons and protons from 
each other. For simplicity assume that the coin is entirely made of 
®;]Cu atoms (of mass 62.92960 u). 

13.6 Write nuclear reaction equations for 
(i) a-decay of ^Ra (ii) a-decay of 2 ^Pu 

(iii) (3—decay of (iv) (3 - -decay of 2 ^Bi 
(v) (3+-decay of ' r 'c (vi) (3+-decay of ^Tc 
(vii) Electron capture of 'm Xc 

13.7 A radioactive isotope has a half-life of T years. How long will it 
take the activity to reduce to a) 3.125%, b) 1% of its original value? 

13.8 The normal activity of living carbon-containing matter is found to 
be about 15 decays per minute for every gram of carbon. This activity 
arises from the small proportion of radioactive present with the 
stable carbon isotope i2 c - When the organism is dead, its 
interaction with the atmosphere (which maintains the above 
equilibrium activity) ceases and its activity begins to drop. From the 
known half-life (5730 years) of ^c, and the measured activity, the 
age of the specimen can be approximately estimated. This is the 
principle of dating used in archaeology. Suppose a specimen 
from Mohenjodaro gives an activity of 9 decays per minute per gram 
of carbon. Estimate the approximate age of the Indus-Valley 




civilisation. 


13.9 Obtain the amount of ™Co necessary to provide a radioactive 
source of 8.0 mCi strength. The half-life of flCo is 5.3 years. 

13.10 The half-life of ;■ ;Sr is 28 years. What is the disintegration rate 
of 15 mg of this isotope? 

13.11 Obtain approximately the ratio of the nuclear radii of the gold 
isotope l ^Au and the silver isotope u ': Ag. 

13.12 Find the Q-value and the kinetic energy of the emitted a- 
particle in the a-decay of (a) ^Ra and (b) Rn . 

Given m (^>Ra ) = 226.02540 u, m (^Rn ) = 222.01750 u, 

m ) = 220.01137 u, m (^po) = 216.00189 u. 

13.13 The radionuclide 11 C decays according to 

B + c'+v. T in =2U.3 min 

The maximum energy of the emitted positron is 0.960 MeV. 

Given the mass values: 

m ( l ,lC) = 11.011434 u and m ( l ’B) = 11.009305 u, 

calculate Q and compare it with the maximum energy of the positron 
emitted. 

13.14 The nucleus f’JiNe decays by (3- emission. Write down the (3- 
decay equation and determine the maximum kinetic energy of the 
electrons emitted. Given that: 


m(f; } 1 Nc) = 22.994466 u 
m ( Na ) = 22.089770 u. 




13.15 The Q value of a nuclear reaction A + b—>C + dis defined 
by Q = [ mA + mb - mC - md]c 2 

where the masses refer to the respective nuclei. Determine from the 
given data the Q-value of the following reactions and state whether 
the reactions are exothermic or endothermic. 

(i) [ii+; J n >, 11^11 

(ii) ^C+'iJC Nellie 

Atomic masses are given to be 
m (f H) = 2.014102 u 
m( : ;H) = 3.016049 u 
m (^ C) = 12.000000 u 
m(fjNc)= 19.992439 u 

13.16 Suppose, we think of fission of a ™Pc nucleus into two equal 
fragments, ,^A1. Is the fission energetically possible? Argue by 
working out Q of the process. Given m (™Fc) = 55.93494 u and 

m (J£a 1) = 27.98191 u. 

13.17 The fission properties of ^ J Pu are very similar to those of 

. The average energy released per fission is 180 MeV. How 
much energy, in MeV, is released if all the atoms in 1 kg of pure 
^'Pu undergo fission? 

13.18 A 1000 MW fission reactor consumes half of its fuel in 5.00 y. 

How much did it contain initially? Assume that the reactor 

operates 80% of the time, that all the energy generated arises from 
the fission of and that this nuclide is consumed only by the 




fission process. 

13.19 How long can an electric lamp of 100W be kept glowing by 
fusion of 2.0 kg of deuterium? Take the fusion reaction as 

1H+^H > ^He+n+3.27 MeV 

13.20 Calculate the height of the potential barrier for a head on 
collision of two deuterons. (Hint: The height of the potential barrier is 
given by the Coulomb repulsion between the two deuterons when 
they just touch each other. Assume that they can be taken as hard 
spheres of radius 2.0 fm.) 

13.21 From the relation R = R 0 A 1/3 , where R 0 is a constant and A is 

the mass number of a nucleus, show that the nuclear matter density 
is nearly constant (i.e. independent of A). 

13.22 For the (3 + (positron) emission from a nucleus, there is another 
competing process known as electron capture (electron from an inner 
orbit, say, the K-shell, is captured by the nucleus and a neutrino is 
emitted). 

e'~z x 

Show that if (3 + emission is energetically allowed, electron capture is 
necessarily allowed but not vice-versa. 

Additional Exercises 

13.23 In a periodic table the average atomic mass of magnesium is 

given as 24.312 u. The average value is based on their relative 
natural abundance on earth. The three isotopes and their masses are 
^ Mg (23.98504u), (24.98584u) and ^Mg (25.98259u). The 

natural abundance of ^Mg is 78.99% by mass. Calculate the 




abundances of other two isotopes. 

13.24 The neutron separation energy is defined as the energy 
required to remove a neutron from the nucleus. Obtain the neutron 
separation energies of the nuclei t ( l ,Ca and ^A1 from the following 
data: 


m(^Ca) = 39.962591 u 
m(^Ca) = 40.962278 u 
m(^Al) = 25.986895 u 
m(^Al) = 26.981541 u 

13.25 A source contains two phosphorous radio nuclides : “p (T 1/2 = 
14.3d) and : ^p(T 1/2 = 25.3d). Initially, 10% of the decays come from 
: ”[ J . How long one must wait until 90% do so? 

13.26 Under certain circumstances, a nucleus can decay by emitting 
a particle more massive than an a-particle. Consider the following 
decay processes: 

“i! Ra -> Pb i 1 C 

iHe 


Calculate the O-values for these decays and determine that both are 
energetically allowed. 

13.27 Consider the fission of by fast neutrons. In one fission 
event, no neutrons are emitted and the final end products, after the 
beta decay of the primary fragments, are l *°Co and . i!, Ru - Calculate 
G for this fission process. The relevant atomic and particle masses 


are 




m( 2 gu) =238.05079 u 
m( l J[jCc) =139.90543 u 
m(^Ru) = 98.90594 u 

13.28 Consider the D-T reaction (deuterium-tritium fusion) 

*H- ,H >. : He+ ii 

(a) Calculate the energy released in MeV in this reaction from the 
data: 

m(-n )=2.014102 u 
m( M l ) =3.016049 u 

(b) Consider the radius of both deuterium and tritium to be 
approximately 2.0 fm. What is the kinetic energy needed to overcome 
the coulomb repulsion between the two nuclei? To what temperature 
must the gas be heated to initiate the reaction? 

(Hint: Kinetic energy required for one fusion event =average thermal 
kinetic energy available with the interacting particles = 2(3kT/2); k = 
Boltzman’s constant, T = absolute temperature.) 

13.29 Obtain the maximum kinetic energy of (3-particles, and the 
radiation frequencies of y decays in the decay scheme shown in Fig. 
13.6. You are given that 


m( 198 Au) = 197.968233 u 
m( 198 Hg) =197.966760 u 





198 , 
79 ^ Ll 



13.30 Calculate and compare the energy released by a) fusion of 1.0 

kg of hydrogen deep within Sun and b) the fission of 1.0 kg of 235 U in 
a fission reactor. 

13.31 Suppose India had a target of producing by 2020 AD, 200,000 
MW of electric power, ten percent of which was to be obtained from 
nuclear power plants. Suppose we are given that, on an average, the 
efficiency of utilization (i.e. conversion to electric energy) of thermal 
energy produced in a reactor was 25%. How much amount of 
fissionable uranium would our country need per year by 2020? Take 

the heat energy per fission of 235 U to be about 200MeV. 
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Chapter Fourteen 


Semiconductor Electronics: 
Materials, devices and simple 
circuits 


14.1 Introduction 

Devices in which a controlled flow of electrons can be obtained are the 
basic building blocks of all the electronic circuits. Before the discovery of 
transistor in 1948, such devices were mostly vacuum tubes (also called 
valves) like the vacuum diode which has two electrodes, viz., anode 
(often called plate) and cathode; triode which has three electrodes - 
cathode, plate and grid; tetrode and pentode (respectively with 4 and 5 
electrodes). In a vacuum tube, the electrons are supplied by a heated 
cathode and the controlled flow of these electrons in vacuum is obtained 
by varying the voltage between its different electrodes. Vacuum is 
required in the inter-electrode space; otherwise the moving electrons may 
lose their energy on collision with the air molecules in their path. In these 
devices the electrons can flow only from the cathode to the anode (i.e., 
only in one direction). Therefore, such devices are generally referred to 
as valves. These vacuum tube devices are bulky, consume high power, 
operate generally at high voltages (-100 V) and have limited life and low 
reliability. The seed of the development of modern solid-state 
semiconductor electronics goes back to 1930’s when it was realised that 
some solid-state semiconductors and their junctions offer the possibility 
of controlling the number and the direction of flow of charge carriers 




through them. Simple excitations like light, heat or small applied voltage 
can change the number of mobile charges in a semiconductor. Note that 
the supply and flow of charge carriers in the semiconductor devices are 
within the solid itself, while in the earlier vacuum tubes/valves, the mobile 
electrons were obtained from a heated cathode and they were made to 
flow in an evacuated space or vacuum. No external heating or large 
evacuated space is required by the semiconductor devices. They are 
small in size, consume low power, operate at low voltages and have long 
life and high reliability. Even the Cathode Ray Tubes (CRT) used in 
television and computer monitors which work on the principle of vacuum 
tubes are being replaced by Liquid Crystal Display (LCD) monitors with 
supporting solid state electronics. Much before the full implications of the 
semiconductor devices was formally understood, a naturally occurring 
crystal of galena (Lead sulphide, PbS) with a metal point contact 
attached to it was used as detector of radio waves. 

In the following sections, we will introduce the basic concepts of 
semiconductor physics and discuss some semiconductor devices like 
junction diodes (a 2-electrode device) and bipolar junction transistor (a 3- 
electrode device). A few circuits illustrating their applications will also be 
described. 

14.2 Classification of Metals, Conductors and 
Semiconductors 

On the basis of conductivity 

On the basis of the relative values of electrical conductivity (a) or 
resistivity (p = l/o), the solids are broadly classified as: 

(i) Metals: They possess very low resistivity (or high conductivity), 
p ~ 10 -2 - 10 -8 Q m 


CT ~ 10 2 - 10 8 S ITT 1 


(ii) Semiconductors: They have resistivity or conductivity intermediate 
to metals and insulators. 

p ~ 10 -5 - 10 6 Q m 

a ~ 10 5 - 10 -6 S rrr 1 

(Hi) Insulators: They have high resistivity (or low conductivity), 
p ~ 10 11 - 10 19 Q m 
a - 10 -11 - 10- 19 S rrr 1 

The values of p and o given above are indicative of magnitude and could 
well go outside the ranges as well. Relative values of the resistivity are 
not the only criteria for distinguishing metals, insulators and 
semiconductors from each other. There are some other differences, 
which will become clear as we go along in this chapter. 

Our interest in this chapter is in the study of semiconductors which could 
be: 

(i) Elemental semiconductors'. Si and Ge 

(ii) Compound semiconductors'. Examples are: 

• Inorganic: CdS, GaAs, CdSe, InP, etc. 

• Organic: anthracene, doped pthalocyanines, etc. 

• Organic polymers: polypyrrole, polyaniline, polythiophene, etc. 

Most of the currently available semiconductor devices are based on 
elemental semiconductors Si or Ge and compound inorganic 
semiconductors. However, after 1990, a few semiconductor devices 
using organic semiconductors and semiconducting polymers have been 



developed signalling the birth of a futuristic technology of polymer- 
electronics and molecular-electronics. In this chapter, we will restrict 
ourselves to the study of inorganic semiconductors, particularly elemental 
semiconductors Si and Ge. The general concepts introduced here for 
discussing the elemental semiconductors, by-and-large, apply to most of 
the compound semiconductors as well. 

On the basis of energy bands 

According to the Bohr atomic model, in an isolated atom the energy of 
any of its electrons is decided by the orbit in which it revolves. But when 
the atoms come together to form a solid they are close to each other. So 
the outer orbits of electrons from neighbouring atoms would come very 
close or could even overlap. This would make the nature of electron 
motion in a solid very different from that in an isolated atom. 

Inside the crystal each electron has a unique position and no two 
electrons see exactly the same pattern of surrounding charges. Because 
of this, each electron will have a different energy level. These different 
energy levels with continuous energy variation form what are called 
energy bands. The energy band which includes the energy levels of the 
valence electrons is called the valence band. The energy band above the 
valence band is called the conduction band. With no external energy, all 
the valence electrons will reside in the valence band. If the lowest level in 
the conduction band happens to be lower than the highest level of the 
valence band, the electrons from the valence band can easily move into 
the conduction band. Normally the conduction band is empty. But when it 
overlaps on the valence band electrons can move freely into it. This is the 
case with metallic conductors. 

If there is some gap between the conduction band and the valence band, 
electrons in the valence band all remain bound and no free electrons are 



available in the conduction band. This makes the material an insulator. 
But some of the electrons from the valence band may gain external 
energy to cross the gap between the conduction band and the valence 
band. Then these electrons will move into the conduction band. At the 
same time they will create vacant energy levels in the valence band 
where other valence electrons can move. Thus the process creates the 
possibility of conduction due to electrons in conduction band as well as 
due to vacancies in the valence band. 

Let us consider what happens in the case of Si or Ge crystal containing N 
atoms. For Si, the outermost orbit is the third orbit (n = 3), while for Ge it 
is the fourth orbit (n = 4). The number of electrons in the outermost orbit 
is 4 (2s and 2p electrons). Hence, the total number of outer electrons in 
the crystal is 4/V. The maximum possible number of electrons in the outer 
orbit is 8 (2s + 6p electrons). So, for the 4/V valence electrons there are 
8 N available energy states. These 8 N discrete energy levels can either 
form a continuous band or they may be grouped in different bands 
depending upon the distance between the atoms in the crystal (see box 
on Band Theory of Solids). 

At the distance between the atoms in the crystal lattices of Si and Ge, the 
energy band of these 8 N states is split apart into two which are separated 
by an energy gap E g (Fig. 14.1). The lower band which is completely 

occupied by the 4 N valence electrons at temperature of absolute zero is 
the valence band. The other band consisting of 4 N energy states, called 
the conduction band, is completely empty at absolute zero. 


Band theory of solids 

Consider that the Si or Ge crystal contains N atoms. Electrons of each atom will have 
discrete energies in different orbits. The electron energy will be same if all the atoms are 


isolated, i.e., separated from each other by a large distance. However, in a crystal, the atoms 
are close to each other (2 to 3 A) and therefore the electrons interact with each other and 
also with the neighbouring atomic cores. The overlap (or interaction) will be more felt by the 
electrons in the outermost orbit while the inner orbit or core electron energies may remain 
unaffected. Therefore, for understanding electron energies in Si or Ge crystal, we need to 
consider the changes in the energies of the electrons in the outermost orbit only. For Si, the 
outermost orbit is the third orbit (n = 3), while for Ge it is the fourth orbit (n = 4). The number 
of electrons in the outermost orbit is 4 (2s and 2p electrons). Hence, the total number of outer 
electrons in the crystal is 4N. The maximum possible number of outer electrons in the orbit is 
8 (2s + 6p electrons). So, out of the 4N electrons, 2N electrons are in the 2N s-states (orbital 
quantum number / = 0) and 2N electrons are in the available 6N p-states. Obviously, some p- 
electron states are empty as shown in the extreme right of Figure. This is the case of well 
separated or isolated atoms [region A of Figure]. 
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Suppose these atoms start coming nearer to each other to form a solid. The energies of 
these electrons in the outermost orbit may change (both increase and decrease) due to the 
interaction between the electrons of different atoms. The 6/V states for / = 1, which originally 

















had identical energies in the isolated atoms, spread out and form an energy band [region B in 
Figure]. Similarly, the 2N states for / = 0, having identical energies in the isolated atoms, split 
into a second band (carefully see the region B of Figure) separated from the first one by an 
energy gap. 

At still smaller spacing, however, there comes a region in which the bands merge with each 
other. The lowest energy state that is a split from the upper atomic level appears to drop 
below the upper state that has come from the lower atomic level. In this region (region C in 
Figure), no energy gap exists where the upper and lower energy states get mixed. 

Finally, if the distance between the atoms further decreases, the energy bands again split 
apart and are separated by an energy gap Eg (region D in Figure). The total number of 

available energy states 8N has been re-apportioned between the two bands (AN states each 
in the lower and upper energy bands). Here the significant point is that there are exactly as 
many states in the lower band ( 4N ) as there are available valence electrons from the atoms 
(4 N). 

Therefore, this band (called the valence band) is completely filled while the upper band is 
completely empty. The upper band is called the conduction band. 


The lowest energy level in the conduction band is shown as E c and 
highest energy level in the valence band is shown as E v . Above E c and 
below E v there are a large number of closely spaced energy levels, as 
shown in Fig. 14.1. 
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Figure 14.1 The energy band positions in a semiconductor at 0 K. The upper band, called the 
conduction band, consists of infinitely large number of closely spaced energy states. The lower 
band, called the valence band, consists of closely spaced completely filled energy states. 

The gap between the top of the valence band and bottom of the 
conduction band is called the energy band gap (Energy gap E g ). It may 

be large, small, or zero, depending upon the material. These different 
situations, are depicted in Fig. 14.2 and discussed below: 

Case /: This refers to a situation, as shown in Fig. 14.2(a). One can have 
a metal either when the conduction band is partially filled and the 
balanced band is partially empty or when the conduction and valance 
bands overlap. When there is overlap electrons from valence band can 
easily move into the conduction band. This situation makes a large 
number of electrons available for electrical conduction. When the valence 
band is partially empty, electrons from its lower level can move to higher 
level making conduction possible. Therefore, the resistance of such 

















materials is low or the conductivity is high. 


Case II: In this case, as shown in Fig. 14.2(b), a large band gap E g exists 
(Eg > 3 eV). There are no electrons in the conduction band, and therefore 


no electrical conduction is possible. Note that the energy gap is so large 
that electrons cannot be excited from the valence band to the conduction 
band by thermal excitation. This is the case of insulators. 


& 

CJ 






C 


<u 


A\ 




Conduction 

band 


Valence 

band 


(i) 


<D 

!? 

<v 


o 

u 

ZJ 

Ce3 


/K 


E c~ 


' Overlapping 
conduction band 


iE Q ~ 0 ) 


Valence 

band 


(a) 


(ii) 






c 

<u 

c 

c 

s-i 


CJ 

OJ 




4\ 




^ Empty 
conduction 
band 


E g >3eV 



Valence 

band 



(b) 


(c) 


Figure 14.2 Difference between energy bands of (a) metals, 
(b) insulators and (c) semiconductors. 


Case III: This situation is shown in Fig. 14.2(c). Here a finite but small 
band gap (E g < 3 eV) exists. Because of the small band gap, at room 

temperature some electrons from valence band can acquire enough 
energy to cross the energy gap and enter the conduction band. These 
electrons (though small in numbers) can move in the conduction band. 
Hence, the resistance of semiconductors is not as high as that of the 
insulators. 


In this section we have made a broad classification of metals, conductors 



















and semiconductors. In the section which follows you will learn the 
conduction process in semiconductors. 

14.3 Intrinsic Semiconductor 

We shall take the most common case of Ge and Si whose lattice 
structure is shown in Fig. 14.3. These structures are called the diamond¬ 
like structures. Each atom is surrounded by four nearest neighbours. We 
know that Si and Ge have four valence electrons. In its crystalline 
structure, every Si or Ge atom tends to share one of its four valence 
electrons with each of its four nearest neighbour atoms, and also to take 
share of one electron from each such neighbour. These shared electron 
pairs are referred to as forming a covalent bond or simply a valence 
bond. The two shared electrons can be assumed to shuttle back-and- 
forth between the associated atoms holding them together strongly. 

Figure 14.4 schematically shows the 2-dimensional representation of Si 
or Ge structure shown in Fig. 14.3 which overemphasises the covalent 
bond. It shows an idealised picture in which no bonds are broken (all 
bonds are intact). Such a situation arises at low temperatures. As the 
temperature increases, more thermal energy becomes available to these 
electrons and some of these electrons may break-away (becoming free 
electrons contributing to conduction). The thermal energy effectively 
ionises only a few atoms in the crystalline lattice and creates a vacancy 
in the bond as shown in Fig. 14.5(a). The neighbourhood, from which the 
free electron (with charge - q) has come out leaves a vacancy with an 
effective charge ( +q ). This vacancy with the effective positive electronic 
charge is called a hole. The hole behaves as an apparent free particle 
with effective positive charge. 



Figure 14.3 Three-dimensional diamond-like crystal structure for Carbon, Silicon or Germanium 

with 

respective lattice spacing a equal to 3.56, 5.43 and 5.66 A. 


In intrinsic semiconductors, the number of free electrons, n e is equal to 
the number of holes, n h . That is 

n e =n h= n \ ( 14 - 1 ) 

where r?j is called intrinsic carrier concentration. 

Semiconductors posses the unique property in which, apart from 
electrons, the holes also move. Suppose there is a hole at site 1 as 
shown in Fig. 14.5(a). The movement of holes can be visualised as 
shown in Fig. 14.5(b). An electron from the covalent bond at site 2 may 
jump to the vacant site 1 (hole). Thus, after such a jump, the hole is at 
site 2 and the site 1 has now an electron. Therefore, apparently, the hole 
has moved from site 1 to site 2. Note that the electron originally set free 
[Fig. 14.5(a)] is not involved in this process of hole motion. The free 





















electron moves completely independently as conduction electron and 
gives rise to an electron current, l e under an applied electric field. 

Remember that the motion of hole is only a convenient way of describing 
the actual motion of bound electrons, whenever there is an empty bond 
anywhere in the crystal. Under the action of an electric field, these holes 
move towards negative potential giving the hole current, / h . The total 

current, / is thus the sum of the electron current l e and the hole current / h : 
/=/ e + / h (14.2) 
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Figure 14.4 Schematic two-dimensional representation of Si or Ge structure showing covalent 
bonds at low temperature (all bonds intact). +4 symbol indicates inner cores of Si or Ge. 



It may be noted that apart from the process of generation of conduction 
electrons and holes, a simultaneous process of recombination occurs in 
which the electrons recombine with the holes. At equilibrium, the rate of 
generation is equal to the rate of recombination of charge carriers. The 
recombination occurs due to an electron colliding with a hole. 






An intrinsic semiconductor will behave like an insulator at T = 0 K as 
shown in Fig. 14.6(a). It is the thermal energy at higher temperatures (T > 
OK), which excites some electrons from the valence band to the 
conduction band. These thermally excited electrons at T > 0 K, partially 
occupy the conduction band. Therefore, the energy-band diagram of an 
intrinsic semiconductor will be as shown in Fig. 14.6(b). Here, some 
electrons are shown in the conduction band. These have come from the 
valence band leaving equal number of holes there. 
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Figure 14.5 (a) Schematic model of generation of hole at site 1 and conduction electron due to 
thermal energy at moderate temperatures, (b) Simplified representation of possible thermal 
motion of a hole. The electron from the lower left hand covalent bond (site 2) goes to the earlier 
hole sitel, leaving a hole at its site indicating an apparent movement of the hole from site 1 to 

site 2. 
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Figure 14.6 (a) An intrinsic semiconductor at T = 0 K behaves like insulator, (b) At T> 0 K, four 
thermally generated electron-hole pairs. The filled circles (.) represent electrons and empty 

circles ( ) represent holes. 


Example 14.1 C, Si and Ge have same lattice structure. Why is C 
insulator while Si and Ge intrinsic semiconductors? 

Solution The 4 bonding electrons of C, Si or Ge lie, respectively, in 
the second, third and fourth orbit. Hence, energy required to take out 
an electron from these atoms (i.e., ionisation energy E g ) will be least 

for Ge, followed by Si and highest for C. Hence, number of free 
electrons for conduction in Ge and Si are significant but negligibly 
small for C. 


14.4 Extrinsic Semiconductor 

The conductivity of an intrinsic semiconductor depends on its 
temperature, but at room temperature its conductivity is very low. As 
such, no important electronic devices can be developed using these 
semiconductors. Hence there is a necessity of improving their 
conductivity. This can be done by making use of impurities. 



























When a small amount, say, a few parts per million (ppm), of a suitable 
impurity is added to the pure semiconductor, the conductivity of the 
semiconductor is increased manifold. Such materials are known as 
extrinsic semiconductors or impurity semiconductors. The deliberate 
addition of a desirable impurity is called doping and the impurity atoms 
are called dopants. Such a material is also called a doped 
semiconductor. The dopant has to be such that it does not distort the 
original pure semiconductor lattice. It occupies only a very few of the 
original semiconductor atom sites in the crystal. A necessary condition to 
attain this is that the sizes of the dopant and the semiconductor atoms 
should be nearly the same. 

There are two types of dopants used in doping the tetravalent Si or Ge: 

(i) Pentavalent (valency 5); like Arsenic (As), Antimony (Sb), 
Phosphorous (P), etc. 

(ii) Trivalent (valency 3); like Indium (In), Boron (B), Aluminium (Al), etc. 

We shall now discuss how the doping changes the number of charge 
carriers (and hence the conductivity) of semiconductors. Si or Ge belongs 
to the fourth group in the Periodic table and, therefore, we choose the 
dopant element from nearby fifth or third group, expecting and taking 
care that the size of the dopant atom is nearly the same as that of Si or 
Ge. Interestingly, the pentavalent and trivalent dopants in Si or Ge give 
two entirely different types of semiconductors as discussed below. 

(i) n-type semiconductor 

Suppose we dope Si or Ge with a pentavalent element as shown in Fig. 
14.7. When an atom of +5 valency element occupies the position of an 
atom in the crystal lattice of Si, four of its electrons bond with the four 
silicon neighbours while the fifth remains very weakly bound to its parent 
atom. This is because the four electrons participating in bonding are seen 



as part of the effective core of the atom by the fifth electron. As a result 
the ionisation energy required to set this electron free is very small and 
even at room temperature it will be free to move in the lattice of the 
semiconductor. For example, the energy required is - 0.01 eV for 
germanium, and 0.05 eV for silicon, to separate this electron from its 
atom. This is in contrast to the energy required to jump the forbidden 
band (about 0.72 eV for germanium and about 1.1 eV for silicon) at room 
temperature in the intrinsic semiconductor. Thus, the pentavalent dopant 
is donating one extra electron for conduction and hence is known as 
donor impurity. The number of electrons made available for conduction 
by dopant atoms depends strongly upon the doping level and is 
independent of any increase in ambient temperature. On the other hand, 
the number of free electrons (with an equal number of holes) generated 
by Si atoms, increases weakly with temperature. 
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Figure 14.1 (a) Pentavalent donor atom (As, Sb, P, etc.) doped for tetravalent Si or Ge giving n- 
type semiconductor, and (b) Commonly used schematic representation of n-type material which 
shows only the fixed cores of the substituent donors with one additional effective positive charge 

and its associated extra electron. 


In a doped semiconductor the total number of conduction electrons n e is 

due to the electrons contributed by donors and those generated 
intrinsically, while the total number of holes r? h is only due to the holes 











from the intrinsic source. But the rate of recombination of holes would 
increase due to the increase in the number of electrons. As a result, the 
number of holes would get reduced further. 

Thus, with proper level of doping the number of conduction electrons can 
be made much larger than the number of holes. Hence in an extrinsic 
semiconductor doped with pentavalent impurity, electrons become the 
majority carriers and holes the minority carriers. These semiconductors 
are, therefore, known as n -type semiconductors. For n-type 
semiconductors, we have, 

n e »n h (14.3) 

(ii) p-type semiconductor 

This is obtained when Si or Ge is doped with a trivalent impurity like Al, 
B, In, etc. The dopant has one valence electron less than Si or Ge and, 
therefore, this atom can form covalent bonds with neighbouring three Si 
atoms but does not have any electron to offer to the fourth Si atom. So 
the bond between the fourth neighbour and the trivalent atom has a 
vacancy or hole as shown in Fig. 14.8. Since the neighbouring Si atom in 
the lattice wants an electron in place of a hole, an electron in the outer 
orbit of an atom in the neighbourhood may jump to fill this vacancy, 
leaving a vacancy or hole at its own site. Thus the hole is available for 
conduction. Note that the trivalent foreign atom becomes effectively 
negatively charged when it shares fourth electron with neighbouring Si 
atom. Therefore, the dopant atom of p-type material can be treated as 
core of one negative charge along with its associated hole as shown in 
Fig. 14.8(b). It is obvious that one acceptor atom gives one hole. These 
holes are in addition to the intrinsically generated holes while the source 
of conduction electrons is only intrinsic generation. Thus, for such a 
material, the holes are the majority carriers and electrons are minority 



carriers. Therefore, extrinsic semiconductors doped with trivalent impurity 
are called p -type semiconductors. For p-type semiconductors, the 
recombination process will reduce the number (r?j)of intrinsically 

generated electrons to n e . We have, for p-type semiconductors 
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Figure 14.8 (a) Trivalent acceptor atom (In, At, B etc.) doped in tetravalent Si or Ge lattice giving 
p-type semiconductor, (b) Commonly used schematic representation of p-type material which 
shows only the fixed core of the substituent acceptor with one effective additional negative charge 






and its associated hole. 


n h >> n e ( 14 - 4 ) 

Note that the crystal maintains an overall charge neutrality as the charge 
of additional charge carriers is just equal and opposite to that of the 
ionised cores in the lattice. 

In extrinsic semiconductors, because of the abundance of majority 
current carriers, the minority carriers produced thermally have more 
chance of meeting majority carriers and thus getting destroyed. Hence, 
the dopant, by adding a large number of current carriers of one type, 
which become the majority carriers, indirectly helps to reduce the intrinsic 
concentration of minority carriers. 

The semiconductor’s energy band structure is affected by doping. In the 
case of extrinsic semiconductors, additional energy states due to donor 
impurities (E D ) and acceptor impurities (E A ) also exist. In the energy band 

diagram of n-type Si semiconductor, the donor energy level E D is slightly 

below the bottom E c of the conduction band and electrons from this level 

move into the conduction band with very small supply of energy. At room 

temperature, most of the donor atoms get ionised but very few (~10 -12 ) 
atoms of Si get ionised. So the conduction band will have most electrons 
coming from the donor impurities, as shown in Fig. 14.9(a). Similarly, for 
p-type semiconductor, the acceptor energy level E A is slightly above the 

top E v of the valence band as shown in Fig. 14.9(b). With very small 

supply of energy an electron from the valence band can jump to the level 
E a and ionise the acceptor negatively. (Alternately, we can also say that 

with very small supply of energy the hole from level E A sinks down into 



the valence band. Electrons rise up and holes fall down when they gain 
external energy.) At room temperature, most of the acceptor atoms get 
ionised leaving holes in the valence band. Thus at room temperature the 
density of holes in the valence band is predominantly due to impurity in 
the extrinsic semiconductor. The electron and hole concentration in a 
semiconductor in thermal equilibrium is given by 

n e n h = n f (14.5) 


Though the above description is grossly approximate and hypothetical, it 
helps in understanding the difference between metals, insulators and 
semiconductors (extrinsic and intrinsic) in a simple manner. The 
difference in the resistivity of C, Si and Ge depends upon the energy gap 
between their conduction and valence bands. For C (diamond), Si and 
Ge, the energy gaps are 5.4 eV, 1.1 eV and 0.7 eV, respectively. Sn also 
is a group IV element but it is a metal because the energy gap in its case 
is 0 eV. 
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Figure 14.9 Energy bands of (a) n-type semiconductor at T > OK, (b) p-type 

semiconductor at T > OK. 


























Example 14.2 Suppose a pure Si crystal has 5 * 10 23 atoms m 3 it is doped by 1 ppm 
concentration of pentavalent As. Calculate the number of electrons and holes. Given that nj 

=1.5 x 10 16 m -3 . 

Solution Note that thermally generated electrons (nj - 10 ^ m - 3) are 
negligibly small as compared to those produced by doping. 

Therefore, n e * N D . 

2 

Since n e n h = nj , The number of holes 
n h = (2.25 x 10 32 )/(5 xio 22 ) 

~ 4.5 x 10 9 m -3 


14.5 p-n Junction 

A p-n junction is the basic building block of many semiconductor devices 
like diodes, transistor, etc. A clear understanding of the junction 
behaviour is important to analyse the working of other semiconductor 
devices. We will now try to understand how a junction is formed and how 
the junction behaves under the influence of external applied voltage (also 
called bias). 

14.5.1 p-n junction formation 

Consider a thin p-type silicon (p-Si) semiconductor wafer. By adding 
precisely a small quantity of pentavelent impurity, part of the p-Si wafer 
can be converted into n-Si. There are several processes by which a 
semiconductor can be formed. The wafer now contains p-region and n- 




region and a metallurgical junction between p-, and n- region. 

Two important processes occur during the formation of a p-n junction: 
diffusion and drift. We know that in an n-type semiconductor, the 
concentration of electrons (number of electrons per unit volume) is more 
compared to the concentration of holes. Similarly, in a p-type 
semiconductor, the concentration of holes is more than the concentration 
of electrons. During the formation of p-n junction, and due to the 
concentration gradient across p-, and n- sides, holes diffuse from p-side 
to n-side (p —► n) and electrons diffuse from n-side to p-side (n —> p). This 
motion of charge carries gives rise to diffusion current across the 
junction. 

When an electron diffuses from n —► p, it leaves behind an ionised donor 
on n-side. This ionised donor (positive charge) is immobile as it is bonded 
to the surrounding atoms. As the electrons continue to diffuse from n —► 
p, a layer of positive charge (or positive space-charge region) on n-side 
of the junction is developed. 

Similarly, when a hole diffuses from p —> n due to the concentration 
gradient, it leaves behind an ionised acceptor (negative charge) which is 
immobile. As the holes continue to diffuse, a layer of negative charge (or 
negative space-charge region) on the p-side of the junction is developed. 
This space-charge region on either side of the junction together is known 
as depletion region as the electrons and holes taking part in the initial 
movement across the junction depleted the region of its free charges 
(Fig. 14.10). The thickness of depletion region is of the order of one-tenth 
of a micrometre. Due to the positive space-charge region on n-side of the 
junction and negative space charge region on p-side of the junction, an 
electric field directed from positive charge towards negative charge 
develops. Due to this field, an electron on p-side of the junction moves to 
n-side and a hole on n-side of the junction moves to p-side. The motion 



of charge carriers due to the electric field is called drift. Thus a drift 
current, which is opposite in direction to the diffusion current (Fig. 14.10) 
starts. 
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Figure 14.10 p-n junction formation process. 

Initially, diffusion current is large and drift current is small. As the diffusion 
process continues, the space-charge regions on either side of the 
junction extend, thus increasing the electric field strength and hence drift 
current. This process continues until the diffusion current equals the drift 
current. Thus a p-n junction is formed. In a p-n junction under equilibrium 
there is no net current. 

The loss of electrons from the n-region and the gain of electron by the p- 
region causes a difference of potential across the junction of the two 
regions. The polarity of this potential is such as to oppose further flow of 
carriers so that a condition of equilibrium exists. Figure 14.11 shows the 
p-n junction at equilibrium and the potential across the junction. The n- 
material has lost electrons, and p material has acquired electrons. The n 
material is thus positive relative to the p material. Since this potential 
tends to prevent the movement of electron from the n region into the p 
region, it is often called a barrier potential. 
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Figure 14.11 (a) Diode under equilibrium (V = 0), (b) Barrier potential under no bias. 


Example 14.3 Can we take one slab of p-type semiconductor and physically join it to another 
n-type semiconductor to get p-n junction? 

Solution No! Any slab, howsoever flat, will have roughness much larger than the inter-atomic 
crystal spacing (~2 to 3 A) and hence continuous contact at the atomic level will not be 
possible. The junction will behave as a discontinuity for the flowing charge carriers. 


14.6 Semiconductor Diode 

A semiconductor diode [Fig. 14.12(a)] is basically a p-n junction with 
metallic contacts provided at the ends for the application of an external 
voltage. It is a two terminal device. A p-n junction diode is symbolically 
represented as shown in Fig. 14.12(b). 

The direction of arrow indicates the conventional direction of current 

















(when the diode is under forward bias). The equilibrium barrier potential 
can be altered by applying an external voltage V across the diode. The 
situation of p-n junction diode under equilibrium (without bias) is shown in 
Fig. 14.11(a) and (b). 


14.6.1 p-n junction diode under forward bias 

When an external voltage V is applied across a semiconductor diode 
such that p-side is connected to the positive terminal of the battery and n- 
side to the negative terminal [Fig. 14.13(a)], it is said to be forward 
biased. 


The applied voltage mostly drops across the depletion region and the 
voltage drop across the p-side and n-side of the junction is negligible. 
(This is because the resistance of the depletion region - a region where 
there are no charges - is very high compared to the resistance of n-side 
and p-side.) The direction of the applied voltage (\/) is opposite to the 
built-in potential V 0 . As a result, the depletion layer width decreases and 

the barrier height is reduced [Fig. 14.13(b)]. The effective barrier height 
under forward bias is (Vo - V). 
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Figure 14.12 (a) Semiconductor diode, (b) Symbol for p-n junction diode. 








If the applied voltage is small, the barrier potential will be reduced only 
slightly below the equilibrium value, and only a small number of carriers 
in the material—those that happen to be in the uppermost energy levels 
—will possess enough energy to cross the junction. So the current will be 
small. If we increase the applied voltage significantly, the barrier height 
will be reduced and more number of carriers will have the required 
energy. Thus the current increases. 

Due to the applied voltage, electrons from n-side cross the depletion 
region and reach p-side (where they are minority carries). Similarly, holes 
from p-side cross the junction and reach the n-side (where they are 
minority carries). This process under forward bias is known as minority 
carrier injection. At the junction boundary, on each side, the minority 
carrier concentration increases significantly compared to the locations far 
from the junction. 
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Figure 14.13 (a) p-n junction diode under forward bias, (b) Barrier potential 
(1) without battery, (2) Low battery voltage, and (3) High voltage battery. 
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Figure 14.14 Forward bias minority carrier injection. 


Due to this concentration gradient, the injected electrons on p-side 
diffuse from the junction edge of p-side to the other end of p-side. 
Likewise, the injected holes on n-side diffuse from the junction edge of n- 
side to the other end of n-side (Fig. 14.14). This motion of charged 
carriers on either side gives rise to current. The total diode forward 
current is sum of hole diffusion current and conventional current due to 
electron diffusion. The magnitude of this current is usually in mA. 

14.6.2 p-n junction diode under reverse bias 

When an external voltage ( V) is applied across the diode such that n-side 
is positive and p-side is negative, it is said to be reverse biased 
[Fig. 14.15(a)]. The applied voltage mostly drops across the depletion 
region. The direction of applied voltage is same as the direction of barrier 
potential. As a result, the barrier height increases and the depletion 
region widens due to the change in the electric field. The effective barrier 
height under reverse bias is (\/ 0 + V), [Fig. 14.15(b)]. This suppresses the 











flow of electrons from n —► p and holes from p —> n. Thus, diffusion 
current, decreases enormously compared to the diode under forward 
bias. 

The electric field direction of the junction is such that if electrons on p- 
side or holes on n-side in their random motion come close to the junction, 
they will be swept to its majority zone. This drift of carriers gives rise to 
current. The drift current is of the order of a few pA. This is quite low 
because it is due to the motion of carriers from their minority side to their 
majority side across the junction. The drift current is also there under 
forward bias but it is negligible (pA) when compared with current due to 
injected carriers which is usually in mA. 

The diode reverse current is not very much dependent on the applied 
voltage. Even a small voltage is sufficient to sweep the minority carriers 
from one side of the junction to the other side of the junction. The current 
is not limited by the magnitude of the applied voltage but is limited due to 
the concentration of the minority carrier on either side of the junction. 

The current under reverse bias is essentially voltage independent upto a 
critical reverse bias voltage, known as breakdown voltage (\/ br ). When V 

= V br , the diode reverse current increases sharply. Even a slight increase 

in the bias voltage causes large change in the current. If the reverse 
current is not limited by an external circuit below the rated value 
(specified by the manufacturer) the p-n junction will get destroyed. Once 
it exceeds the rated value, the diode gets destroyed due to overheating. 
This can happen even for the diode under forward bias, if the forward 
current exceeds the rated value. 

The circuit arrangement for studying the V-l characteristics of a diode, 
(i.e., the variation of current as a function of applied voltage) are shown in 
Fig. 14.16(a) and (b). The battery is connected to the diode through a 



potentiometer (or reheostat) so that the applied voltage to the diode can 
be changed. For different values of voltages, the value of the current is 
noted. A graph between V and / is obtained as in Fig. 14.16(c). Note that 
in forward bias measurement, we use a milliammeter since the expected 
current is large (as explained in the earlier section) while a micrometer is 
used in reverse bias to measure the current. You can see in Fig. 14.16(c) 
that in forward bias, the current first increases very slowly, almost 
negligibly, till the voltage across the diode crosses a certain value. After 
the characteristic voltage, the diode current increases significantly 
(exponentially), even for a very small increase in the diode bias voltage. 
This voltage is called the threshold voltage or cut-in voltage (-0.2V for 
germanium diode and ~0.7 V for silicon diode). 
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Figure 14.15 (a) Diode under reverse bias, 
(b) Barrier potential under reverse bias. 


For the diode in reverse bias, the current is very small (~pA) and almost 
remains constant with change in bias. It is called reverse saturation 
















current. However, for special cases, at very high reverse bias (break 
down voltage), the current suddenly increases. This special action of the 
diode is discussed later in Section 14.8. The general purpose diode are 
not used beyond the reverse saturation current region. 
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Figure 14.16 Experimental circuit arrangement for studying V-l characteristics of a p-n junction 
diode (a) in forward bias, (b) in reverse bias, (c) Typical V-l characteristics of a silicon diode. 


The above discussion shows that the p-n junction diode primerly allows 
the flow of current only in one direction (forward bias). The forward bias 
resistance is low as compared to the reverse bias resistance. This 
property is used for rectification of ac voltages as discussed in the next 
section. For diodes, we define a quantity called dynamic resistance as 
the ratio of small change in voltage AV to a small change in current Al: 

AV 
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(14.6) 


























Example 14.4 The V-l characteristic of a silicon diode is shown in the Fig. 14.17. Calculate 
the resistance of the diode at (a) /□ = 15 mA and (b) Vq = -10 V. 



Solution Considering the diode characteristics as a straight line 
between 1 = 10 mA to I = 20 mA passing through the origin, we can 
calculate the resistance using Ohm’s law. 

(a) From the curve, at I = 20 mA, V = 0.8 V; I = 10 mA, V = 0.7 V 
r fb = AV/AI = 0.1V/10 mA = 10 Q 

(b) From the curve at V = -10 V, I = -1 pA, 

Therefore, 

r rb = 10 V/1 pA= 1.0 x 10 7 Q 













14.7 Application of Junction Diode as a Rectifier 

From the V-l characteristic of a junction diode we see that it allows 
current to pass only when it is forward biased. So if an alternating voltage 
is applied across a diode the current flows only in that part of the cycle 
when the diode is forward biased. This property is used to rectify 
alternating voltages and the circuit used for this purpose is called a 
rectifier. 

If an alternating voltage is applied across a diode in series with a load, a 
pulsating voltage will appear across the load only during the half cycles of 
the ac input during which the diode is forward biased. Such rectifier 
circuit, as shown in Fig. 14.18, is called a half-wave rectifier. The 
secondary of a transformer supplies the desired ac voltage across 
terminals A and B. When the voltage at A is positive, the diode is forward 
biased and it conducts. When A is negative, the diode is reverse-biased 
and it does not conduct. The reverse saturation current of a diode is 
negligible and can be considered equal to zero for practical purposes. 
(The reverse breakdown voltage of the diode must be sufficiently higher 
than the peak ac voltage at the secondary of the transformer to protect 
the diode from reverse breakdown.) 
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Figure 14.18 (a) Half-wave rectifier circuit, (b) Input ac voltage and output voltage waveforms from 

the rectifier circuit. 


Therefore, in the positive half-cycle of ac there is a current through the 
load resistor R L and we get an output voltage, as shown in Fig. 14.18(b), 

whereas there is no current in the negative half-cycle. In the next positive 
half-cycle, again we get the output voltage. Thus, the output voltage, 
though still varying, is restricted to only one direction and is said to be 
rectified. Since the rectified output of this circuit is only for half of the 
input ac wave it is called as half-wave rectifier. 

The circuit using two diodes, shown in Fig. 14.19(a), gives output rectified 
voltage corresponding to both the positive as well as negative half of the 
































ac cycle. Hence, it is known as full-wave rectifier. Here the p-side of the 
two diodes are connected to the ends of the secondary of the 
transformer. The n-side of the diodes are connected together and the 
output is taken between this common point of diodes and the midpoint of 
the secondary of the transformer. So for a full-wave rectifier the 
secondary of the transformer is provided with a centre tapping and so it is 
called centre-tap transformer. As can be seen from Fig. 14.19(c) the 
voltage rectified by each diode is only half the total secondary voltage. 
Each diode rectifies only for half the cycle, but the two do so for alternate 
cycles. Thus, the output between their common terminals and the centre- 
tap of the transformer becomes a full-wave rectifier output. (Note that 
there is another circuit of full wave rectifier which does not need a centre- 
tap transformer but needs four diodes.) Suppose the input voltage to A 
with respect to the centre tap at any instant is positive. It is clear that, at 
that instant, voltage at B being out of phase will be negative as shown in 
Fig. 14.19(b). So, diode D 1 gets forward biased and conducts (while D 2 

being reverse biased is not conducting). Hence, during this positive half 
cycle we get an output current (and a output voltage across the load 
resistor R L ) as shown in Fig. 14.19(c). In the course of the ac cycle when 

the voltage at A becomes negative with respect to centre tap, the voltage 
at B would be positive. In this part of the cycle diode D 1 would not 

conduct but diode D 2 would, giving an output current and output voltage 

(across R L ) during the negative half cycle of the input ac. Thus, we get 

output voltage during both the positive as well as the negative half of the 
cycle. Obviously, this is a more efficient circuit for getting rectified voltage 
or current than the half-wave rectifier. 
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Figure 14.19 (a) A Full-wave rectifier circuit; (b) Input wave forms given to the diode D-i at A and 
to the diode D2 at B; (c) Output waveform across the load RL connected in the full-wave rectifier 

circuit. 

The rectified voltage is in the form of pulses of the shape of half 
sinusoids. Though it is unidirectional it does not have a steady value. To 
get steady dc output from the pulsating voltage normally a capacitor is 
connected across the output terminals (parallel to the load f? L ). One can 

also use an inductor in series with R L for the same purpose. Since these 

additional circuits appear to filter out the ac ripple and give a pure dc 

























voltage, so they are called filters. 

Now we shall discuss the role of capacitor in filtering. When the voltage 
across the capacitor is rising, it gets charged. If there is no external load, 
it remains charged to the peak voltage of the rectified output. When there 
is a load, it gets discharged through the load and the voltage across it 
begins to fall. In the next half-cycle of rectified output it again gets 
charged to the peak value (Fig. 14.20). The rate of fall of the voltage 
across the capacitor depends inversely upon the product of capacitance 
C and the effective resistance R L used in the circuit and is called the time 

constant. To make the time constant large value of C should be large. So 
capacitor input filters use large capacitors. The output voltage obtained 
by using capacitor input filter is nearer to the peak voltage of the rectified 
voltage. This type of filter is most widely used in power supplies. 



FIGURE 14.20 (a) A full-wave rectifier with capacitor filter, (b) Input and output 

voltage of rectifier in (a) 

14.8 Special Purpose p-n Junction Diodes 

In the section, we shall discuss some devices which are basically junction 
diodes but are developed for different applications. 


14.8.1 Zener diode 

















It is a special purpose semiconductor diode, named after its inventor C. 
Zener. It is designed to operate under reverse bias in the breakdown 
region and used as a voltage regulator. The symbol for Zener diode is 
shown in Fig. 14.21(a). 

Zener diode is fabricated by heavily doping both p-, and n- sides of the 
junction. Due to this, depletion region formed is very thin (<10 -6 m) and 

the electric field of the junction is extremely high (~5x10 6 V/m) even for a 
small reverse bias voltage of about 5V. The l-V characteristics of a Zener 
diode is shown in Fig. 14.21(b). It is seen that when the applied reverse 
bias voltage(\/) reaches the breakdown voltage (\/ z ) of the Zener diode, 

there is a large change in the current. Note that after the breakdown 
voltage V z , a large change in the current can be produced by almost 

insignificant change in the reverse bias voltage. In other words, Zener 
voltage remains constant, even though current through the Zener diode 
varies over a wide range. This property of the Zener diode is used for 
regulating supply voltages so that they are constant. 



V 


J 


(a) 


J |niA) 


Reverse bias 

K 


Forward bias 


+ V(V) 


JCM-A) 

(b) 


Figure 14.21 Zener diode, 

(a) symbol, (b) l-V characteristics. 


Let us understand how reverse current suddenly increases at the 
breakdown voltage. We know that reverse current is due to the flow of 
electrons (minority carriers) from p —► n and holes from n —► p. As the 
reverse bias voltage is increased, the electric field at the junction 
becomes significant. When the reverse bias voltage V = V z , then the 

electric field strength is high enough to pull valence electrons from the 
host atoms on the p-side which are accelerated to n-side. These 
electrons account for high current observed at the breakdown. The 








emission of electrons from the host atoms due to the high electric field is 
known as internal field emission or field ionisation. The electric field 

required for field ionisation is of the order of 10 6 V/m. 

Zener diode as a voltage regulator 

We know that when the ac input voltage of a rectifier fluctuates, its 
rectified output also fluctuates. To get a constant dc voltage from the dc 
unregulated output of a rectifier, we use a Zener diode. The circuit 
diagram of a voltage regulator using a Zener diode is shown in Fig. 
14.22. 

The unregulated dc voltage (filtered output of a rectifier) is connected to 
the Zener diode through a series resistance R s such that the Zener diode 

is reverse biased. If the input voltage increases, the current through R s 

and Zener diode also increases. This increases the voltage drop across 
R s without any change in the voltage across the Zener diode. This is 

because in the breakdown region, Zener voltage remains constant even 
though the current through the Zener diode changes. Similarly, if the 
input voltage decreases, the current through R s and Zener diode also 

decreases. The voltage drop across R s decreases without any change in 

the voltage across the Zener diode. Thus any increase/decrease in the 
input voltage results in, increase/decrease of the voltage drop across R s 

without any change in voltage across the Zener diode. Thus the Zener 
diode acts as a voltage regulator. We have to select the Zener diode 
according to the required output voltage and accordingly the series 
resistance R s . 
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Figure 14.22 Zener diode as DC voltage regulator 


Example 14.5 In a Zener regulated power supply a Zener diode with = 6.0 V is used for 

regulation. The load current is to be 4.0 mA and the unregulated input is 10.0 V. What should 
be the value of series resistor Rg? 

Solution 

The value of Rg should be such that the current through the Zener diode is much larger than 

the load current. This is to have good load regulation. Choose Zener current as five times the 
load current, i.e., Iz = 20 mA. The total current through Rg is, therefore, 24 mA. The voltage 

drop across Rg is 10.0 - 6.0 = 4.0 V. This gives Rg = 4.0V/(24 x 10 - ^) a = 167 Q. The 

nearest value of carbon resistor is 150 Q. So, a series resistor of 150 Q is appropriate. Note 
that slight variation in the value of the resistor does not matter, what is important is that the 
current Iz should be sufficiently larger than /|_. 


14.8.2 Optoelectronic junction devices 

We have seen so far, how a semiconductor diode behaves under applied 
electrical inputs. In this section, we learn about semiconductor diodes in 











which carriers are generated by photons (photo-excitation). All these 
devices are called optoelectronic devices. We shall study the functioning 
of the following optoelectronic devices: 

(i) Photodiodes used for detecting optical signal (photodetectors). 

(ii) Light emitting diodes (LED) which convert electrical energy into light. 

(iii) Photovoltaic devices which convert optical radiation into electricity 
(solar cells). 

(i) Photodiode 

A Photodiode is again a special purpose p-n junction diode fabricated 
with a transparent window to allow light to fall on the diode. It is operated 
under reverse bias. When the photodiode is illuminated with light 
(photons) with energy (hv) greater than the energy gap (E g ) of the 

semiconductor, then electron-hole pairs are generated due to the 
absorption of photons. The diode is fabricated such that the generation of 
e-h pairs takes place in or near the depletion region of the diode. Due to 
electric field of the junction, electrons and holes are separated before 
they recombine. The direction of the electric field is such that electrons 
reach n-side and holes reach p-side. Electrons are collected on n-side 
and holes are collected on p-side giving rise to an emf. When an external 
load is connected, current flows. The magnitude of the photocurrent 
depends on the intensity of incident light (photocurrent is proportional to 
incident light intensity). 

It is easier to observe the change in the current with change in the light 
intensity, if a reverse bias is applied. Thus photodiode can be used as a 
photodetector to detect optical signals. The circuit diagram used for the 
measurement of l-V characteristics of a photodiode is shown in Fig. 
14.23(a) and a typical l-V characteristics in Fig. 14.23(b). 
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Figure 14.23 (a) An illuminated photodiode under reverse bias, (b) l-V characteristics of a 
photodiode for different illumination intensity l 4 > lg> !g> /■/■ 


Example 14.6 The current in the forward bias is known to be more (~mA) than the current in 
the reverse bias (~|jA). What is the reason then to operate the photodiodes in reverse bias? 

Solution Consider the case of an n-type semiconductor. Obviously, the majority carrier 
density (n) is considerably larger than the minority hole density p (i.e., n » p). On 
illumination, let the excess electrons and holes generated be An and Ap, respectively: 

n' = n + An 

p' = p + Ap 




























Here n' and p' are the electron and hole concentrations* at any particular illumination and n 
and p are carriers concentration when there is no illumination. Remember An = Ap and n » 
p. Hence, the fractional change in the majority carriers (i.e., A n/n) would be much less than 
that in the minority carriers (i.e., Ap/p). In general, we can state that the fractional change due 
to the photo-effects on the minority carrier dominated reverse bias current is more easily 
measurable than the fractional change in the forward bias current. Hence, photodiodes are 
preferably used in the reverse bias condition for measuring light intensity. 


(ii) Light emitting diode 

It is a heavily doped p-n junction which under forward bias emits 
spontaneous radiation. The diode is encapsulated with a transparent 
cover so that emitted light can come out. 

When the diode is forward biased, electrons are sent from n —► p (where 
they are minority carriers) and holes are sent from p —► n (where they are 
minority carriers). At the junction boundary the concentration of minority 
carriers increases compared to the equilibrium concentration (i.e., when 
there is no bias). Thus at the junction boundary on either side of the 
junction, excess minority carriers are there which recombine with majority 
carriers near the junction. On recombination, the energy is released in 
the form of photons. Photons with energy equal to or slightly less than the 
band gap are emitted. When the forward current of the diode is small, the 
intensity of light emitted is small. As the forward current increases, 
intensity of light increases and reaches a maximum. Further increase in 
the forward current results in decrease of light intensity. LEDs are biased 
such that the light emitting efficiency is maximum. 

The V-l characteristics of a LED is similar to that of a Si junction diode. 
But the threshold voltages are much higher and slightly different for each 




colour. The reverse breakdown voltages of LEDs are very low, typically 
around 5V. So care should be taken that high reverse voltages do not 
appear across them. 

LEDs that can emit red, yellow, orange, green and blue light are 
commercially available. The semiconductor used for fabrication of visible 
LEDs must at least have a band gap of 1.8 eV (spectral range of visible 
light is from about 0.4 pm to 0.7 pm, i.e., from about 3 eV to 1.8 eV). The 
compound semiconductor Gallium Arsenide - Phosphide (GaAs-|_ x P x ) is 

used for making LEDs of different colours. GaAs 0 6 P 0 4 (E g ~ 1.9 eV) is 

used for red LED. GaAs (E g - 1.4 eV) is used for making infrared LED. 

These LEDs find extensive use in remote controls, burglar alarm 
systems, optical communication, etc. Extensive research is being done 
for developing white LEDs which can replace incandescent lamps. 

LEDs have the following advantages over conventional incandescent low 
power lamps: 

(i) Low operational voltage and less power. 

(ii) Fast action and no warm-up time required. 

(iii) The bandwidth of emitted light is 100 A to 500 A or in other words it is 
nearly (but not exactly) monochromatic. 

(iv) Long life and ruggedness. 

(v) Fast on-off switching capability. 

(iii) Solar cell 

A solar cell is basically a p-n junction which generates emf when solar 
radiation falls on the p-n junction. It works on the same principle 
(photovoltaic effect) as the photodiode, except that no external bias is 
applied and the junction area is kept much larger for solar radiation to be 



incident because we are interested in more power. 

A simple p-n junction solar cell is shown in Fig. 14.24. 

A p-Si wafer of about 300 pm is taken over which a thin layer (~0.3 pm) 
of n-Si is grown on one-side by diffusion process. The other side of p-Si 
is coated with a metal (back contact). On the top of n-Si layer, metal 
finger electrode (or metallic grid) is deposited. This acts as a front 
contact. The metallic grid occupies only a very small fraction of the cell 
area (<15%) so that light can be incident on the cell from the top. 

The generation of emf by a solar cell, when light falls on, it is due to the 
following three basic processes: generation, separation and collection— 

(i) generation of e-h pairs due to light (with hv > E g ) close to the junction; 

(ii) separation of electrons and holes due to electric field of the depletion 
region. Electrons are swept to n-side and holes to p-side; 

(iii) the electrons reaching the n-side are collected by the front contact 
and holes reaching p-side are collected by the back contact. Thus p-side 
becomes positive and n-side becomes negative giving rise to 
photovoltage. 



(b) 


(a) 



















Figure 14.24 (a) Typical p-n junction solar cell; (b) Cross-sectional view. 


When an external load is connected as shown in the Fig. 14.25(a) a 
photocurrent / L flows through the load. A typical l-V characteristics of a 

solar cell is shown in the Fig. 14.25(b). 

Note that the I - V characteristics of solar cell is drawn in the fourth 
quadrant of the coordinate axes. This is because a solar cell does not 
draw current but supplies the same to the load. 

Semiconductors with band gap close to 1.5 eV are ideal materials for 
solar cell fabrication. Solar cells are made with semiconductors like Si (E g 

= 1.1 eV), GaAs (E g = 1.43 eV), CdTe (E g = 1.45 eV), CulnSe 2 (E g = 1.04 

eV), etc. The important criteria for the selection of a material for solar cell 
fabrication are (i) band gap (~1.0 to 1.8 eV), (ii) high optical absorption 

(~10 4 cm -1 ), (iii) electrical conductivity, (iv) availability of the raw material, 
and (v) cost. Note that sunlight is not always required for a solar cell. Any 
light with photon energies greater than the bandgap will do. Solar cells 
are used to power electronic devices in satellites and space vehicles and 
also as power supply to some calculators. Production of low-cost 
photovoltaic cells for large-scale solar energy is a topic for research. 
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Figure 14.25 (a) A typical illuminated p-n junction solar cell; 
(b) l-V characteristics of a solar cell. 


Example 14.7 Why are Si and GaAs are preferred materials for solar cells? 
Solution The solar radiation spectrum received by us is shown in Fig. 14.26. 
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Figure 14.26 

The maxima is near 1.5 eV. For photo-excitation, hv > Eg. Hence, semiconductor with band 
gap ~1.5 eV or lower is likely to give better solar conversion efficiency. Silicon has Eg - 1.1 

eV while for GaAs it is -1.53 eV. In fact, GaAs is better (in spite of its higher band gap) than 
Si because of its relatively higher absorption coefficient. If we choose materials like CdS or 
CdSe (Eg - 2.4 eV), we can use only the high energy component of the solar energy for 
photo-conversion and a significant part of energy will be of no use. 

The question arises: why we do not use material like PbS (Eg ~ 
0.4 eV) which satisfy the condition hv > Egfor v maxima 
corresponding to the solar radiation spectra? If we do so, most of the 
solar radiation will be absorbed on the top-layer of solar cell and will 
not reach in or near the depletion region. For effective electron-hole 
separation, due to the junction field, we want the photo-generation to 
occur in the junction region only. 










14.9 Digital Electronics and Logic Gates 

In electronics circuits like amplifiers, oscillators, introduced to you in 
earlier sections, the signal (current or voltage) has been in the form of 
continuous, time-varying voltage or current. Such signals are called 
continuous or analog signals. A typical analog signal is shown in Figure. 
14.27(a). Fig. 14.27(b) shows a pulse waveform in which only discrete 
values of voltages are possible. It is convenient to use binary numbers to 
represent such signals. A binary number has only two digits ‘0’ (say, 0V) 
and T (say, 5V). In digital electronics we use only these two levels of 
voltage as shown in Fig. 14.27(b). Such signals are called Digital Signals. 
In digital circuits only two values (represented by 0 or 1) of the input and 
output voltage are permissible. 

This section is intended to provide the first step in our understanding of 
digital electronics. We shall restrict our study to some basic building 
blocks of digital electronics (called Logic Gates) which process the digital 
signals in a specific manner. Logic gates are used in calculators, digital 
watches, computers, robots, industrial control systems, and in 
telecommunications. 

A light switch in your house can be used as an example of a digital 
circuit. The light is either ON or OFF depending on the switch position. 
When the light is ON, the output value is T. When the light is OFF the 
output value is ‘O’. The inputs are the position of the light switch. The 
switch is placed either in the ON or OFF position to activate the light. 

14.9.1 Logic gates 

A gate is a digital circuit that follows curtain logical relationship between 
the input and output voltages. Therefore, they are generally known as 


logic gates — gates because they control the flow of information. The five 
common logic gates used are NOT, AND, OR, NAND, NOR. Each logic 
gate is indicated by a symbol and its function is defined by a truth table 
that shows all the possible input logic level combinations with their 
respective output logic levels. Truth tables help understand the behaviour 
of logic gates. These logic gates can be realised using semiconductor 
devices. 

(i) NOT gate 

This is the most basic gate, with one input and one output. It produces a 
‘T output if the input is ‘0’ and vice-versa. That is, it produces an inverted 
version of the input at its output. This is why it is also known as an 
inverter. The commonly used symbol together with the truth table for this 
gate is given in Fig. 14.28. 

(ii) OR Gate 

An OR gate has two or more inputs with one output. The logic symbol 
and truth table are shown in Fig. 14.29. The output Y is 1 when either 
input A or input B or both are Is, that is, if any of the input is high, the 
output is high. 




Figure 14.27 (a) Analog signal, (b) Digital signal. 
















Apart from carrying out the above mathematical logic operation, this gate 
can be used for modifying the pulse waveform as explained in the 
following example. 



(b) 

Figure 14.28 

(a) Logic symbol, 

(b) Truth table of 
NOT gate. 
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FIGURE 14.29 (a) Logic symbol (b) Truth table of OR gate. 

Apart from carrying out the above mathematical logic operation, this gate 
can be used for modifying the pulse waveform as explained in the 
following example. 






















Example 14.8 Justify the output waveform (Y) of the OR gate for the following inputs A and B 
given in Fig. 14.30. 

Solution Note the following: 

• At t < t-|; A = 0, B = 0; Hence Y = 0 

• For t 1 to t 2 ; A = 1, B = 0; Hence Y = 1 

• For t 2 to t 3 ; A = 1, B = 1; Hence Y = 1 

• For t 3 to t 4 ; A = 0, B = 1; Hence Y = 1 

• For t 4 to t 5 ; A = 0, B = 0; Hence Y = 0 

• For t 5 to t 6 ; A = 1, B = 0; Hence Y = 1 

• For t > t 6 ; A = 0, B = 1; Hence Y = 1 

Therefore the waveform Y will be as shown in the Fig. 14.30. 
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FIGURE 14.30 



























(iii) AND Gate 

An AND gate has two or more inputs and one output. The output Y of 
AND gate is 1 only when input A and input B are both 1. The logic symbol 
and truth table for this gate are given in Fig. 14.31 
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FIGURE 14.31 (a) Logic symbol, (b) Truth table of AND gate. 

Example 14.9 Take A and B input waveforms similar to that in Example 14.8. Sketch the 
output waveform obtained from AND gate. 

Solution 

• For t < t-|; A = 0, B = 0; Hence Y = 0 

• For t 1 to t 2 ; A = 1, B = 0; Hence Y = 0 

• For t 2 to t 3 ; A = 1, B = 1; Hence Y = 1 

• For t 3 to t 4 ; A = 0, B = 1; Hence Y = 0 

• For t 4 to t 5 ; A = 0, B = 0; Hence Y = 0 

• For t 5 to t 6 ; A = 1, B = 0; Hence Y = 0 

• For t > t 6 ; A = 0, B = 1; Hence Y = 0 

Based on the above, the output waveform for AND gate can be drawn as given below. 
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Figure 14.32 


(iv) NAND Gate 

This is an AND gate followed by a NOT gate. If inputs A and B are both 
T, the output Y is not T. The gate gets its name from this NOT AND 
behaviour. Figure 14.33 shows the symbol and truth table of NAND gate. 


NAND gates are also called Universal Gates since by using these gates 
you can realise other basic gates like OR, AND and NOT (Exercises 
14.12 and 14.13). 
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Figure 14.33 (a) Logic symbol, (b) Truth table of NAND gate. 


Example 14.10 Sketch the output Y from a NAND gate having inputs A and B given below: 

Solution 
























• For t<t-|;A=1,B = 1; Hence Y = 0 

• For t-| to t 2 ; A = 0, B = 0; Hence Y = 1 

• For t 2 to t 3 ; A = 0, B = 1; Hence Y = 1 

• For t 3 to t 4 ; A = 1, B = 0; Hence Y = 1 

• For t 4 to t 5 ; A = 1, B = 1; Hence Y = 0 

• For t 5 to t 6 ; A = 0, B = 0; Hence Y = 1 

• For t > t 6 ; A = 0, B = 1; Hence Y = 1 
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FIGURE 14.34 


(v) NOR Gate 

It has two or more inputs and one output. A NOT- operation applied after 
OR gate gives a NOT-OR gate (or simply NOR gate). Its output Y is ‘1’ 
only when both inputs A and B are ‘O’, i.e., neither one input nor the other 
is T. The symbol and truth table for NOR gate is given in Fig. 14.35. 
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Figure 14.35 (a) Logic symbol, (b) Truth table of NOR gate. 


NOR gates are considered as universal gates because you can obtain all 
the gates like AND, OR, NOT by using only NOR gates (Exercises 14.14 
and 14.15). 


FASTER AND SMALLER: THE FUTURE OF COMPUTER TECHNOLOGY 

The Integrated Chip (1C) is at the heart of all computer systems. In fact ICs are found in 
almost all electrical devices like cars, televisions, CD players, cell phones etc. The 
miniaturisation that made the modern personal computer possible could never have 
happened without the 1C. ICs are electronic devices that contain many transistors, resistors, 
capacitors, connecting wires - all in one package. You must have heard of the 
microprocessor. The microprocessor is an 1C that processes all information in a computer, 
like keeping track of what keys are pressed, running programmes, games etc. The 1C was 
first invented by Jack Kilky at Texas Instruments in 1958 and he was awarded Nobel Prize for 
this in 2000. ICs are produced on a piece of semiconductor crystal (or chip) by a process 
called photolithography. Thus, the entire Information Technology (IT) industry hinges on 
semiconductors. Over the years, the complexity of ICs has increased while the size of its 
features continued to shrink. In the past five decades, a dramatic miniaturisation in computer 
technology has made modern day computers faster and smaller. In the 1970s, Gordon 
Moore, co-founder of INTEL, pointed out that the memory capacity of a chip (1C) 

















approximately doubled every one and a half years. This is popularly known as Moore’s law. 
The number of transistors per chip has risen exponentially and each year computers are 
becoming more powerful, yet cheaper than the year before. It is intimated from current trends 
that the computers available in 2020 will operate at 40 GHz (40,000 MHz) and would be 
much smaller, more efficient and less expensive than present day computers. The explosive 
growth in the semiconductor industry and computer technology is best expressed by a 
famous quote from Gordon Moore: “If the auto industry advanced as rapidly as the 
semiconductor industry, a Rolls Royce would get half a million miles per gallon, and it would 
be cheaper to throw it away than to park it”. 


Summary 

1. Semiconductors are the basic materials used in the present solid state electronic devices 
like diode, transistor, ICs, etc. 

2. Lattice structure and the atomic structure of constituent elements decide whether a 
particular material will be insulator, metal or semiconductor. 

3. Metals have low resistivity (ICT^ to 10 _ ® Qm), insulators have very high resistivity (>10® 

A 

Qm '), while semiconductors have intermediate values of resistivity. 

4. Semiconductors are elemental (Si, Ge) as well as compound (GaAs, CdS, etc.). 

5. Pure semiconductors are called ‘intrinsic semiconductors’. The presence of charge carriers 
(electrons and holes) is an ‘intrinsic’ property of the material and these are obtained as a 
result of thermal excitation. The number of electrons (n e ) is equal to the number of holes (n^ 

) in intrinsic conductors. Holes are essentially electron vacancies with an effective positive 
charge. 

6. The number of charge carriers can be changed by ‘doping’ of a suitable impurity in pure 
semiconductors. Such semiconductors are known as extrinsic semiconductors. These are of 
two types (n-type and p-type). 


7. In n-type semiconductors, n e » nj-, while in p-type semiconductors n^ » n e . 


8. n-type semiconducting Si or Ge is obtained by doping with pentavalent atoms (donors) like 
As, Sb, P, etc., while p-type Si or Ge can be obtained by doping with trivalent atom 
(acceptors) like B, Al, In etc. 

o 

9. n e nh = np in all cases. Further, the material possesses an overall charge neutrality. 

10. There are two distinct band of energies (called valence band and conduction band) in 
which the electrons in a material lie. Valence band energies are low as compared to 
conduction band energies. All energy levels in the valence band are filled while energy levels 
in the conduction band may be fully empty or partially filled. The electrons in the conduction 
band are free to move in a solid and are responsible for the conductivity. The extent of 
conductivity depends upon the energy gap (Eg) between the top of valence band (Ey) and 

the bottom of the conduction band Eq. The electrons from valence band can be excited by 

heat, light or electrical energy to the conduction band and thus, produce a change in the 
current flowing in a semiconductor. 

11. For insulators Eg > 3 eV, for semiconductors Eg is 0.2 eV to 3 eV, while for metals Eg * 0. 

12. p-n junction is the ‘key’ to all semiconductor devices. When such a junction is made, a 
‘depletion layer’ is formed consisting of immobile ion-cores devoid of their electrons or holes. 
This is responsible for a junction potential barrier. 

13. By changing the external applied voltage, junction barriers can be changed. In forward 
bias (n-side is connected to negative terminal of the battery and p-side is connected to the 
positive), the barrier is decreased while the barrier increases in reverse bias. Hence, forward 
bias current is more (mA) while it is very small (pA) in a p-n junction diode. 

14. Diodes can be used for rectifying an ac voltage (restricting the ac voltage to one 
direction). With the help of a capacitor or a suitable filter, a dc voltage can be obtained. 

15. There are some special purpose diodes. 

16. Zener diode is one such special purpose diode. In reverse bias, after a certain voltage, 


the current suddenly increases (breakdown voltage) in a Zener diode. This property has been 
used to obtain voltage regulation. 

17. p-n junctions have also been used to obtain many photonic or 

optoelectronic devices where one of the participating entity is ‘photon’: (a) Photodiodes in 
which photon excitation results in a change of reverse saturation current which helps us to 
measure light intensity; (b) Solar cells which convert photon energy into electricity; (c) Light 
Emitting Diode and Diode Laser in which electron excitation by a bias voltage results in the 
generation of light. 

18. There are some special circuits which handle the digital data consisting of 0 and 1 levels. 
This forms the subject of Digital Electronics. 

19. The important digital circuits performing special logic operations are called logic gates. 
These are: OR, AND, NOT, NAND, and NOR gates. 


Points to Ponder 

1. The energy bands (Eq or Ev) in the semiconductors are space delocalised which means 

that these are not located in any specific place inside the solid. The energies are the overall 
averages. When you see a picture in which Eq or Ev are drawn as straight lines, then they 

should be respectively taken simply as the bottom of conduction band energy levels and top 
of valence band energy levels. 

2. In elemental semiconductors (Si or Ge), the n-type or p-type semiconductors are obtained 
by introducing ‘dopants’ as defects. In compound semiconductors, the change in relative 
stoichiometric ratio can also change the type of semiconductor. For example, in ideal GaAs 
the ratio of Ga:As is 1:1 but in Ga-rich or As-rich GaAs it could respectively be Ga-| i Asg.g 

or Gag.g Asi.- In general, the presence of defects control the properties of semiconductors 
in many ways. 




3. In modern day circuit, many logical gates or circuits are integrated inone single ‘Chip’. 
These are known as Intgrated circuits (1C). 


-- X 

Exercises 

14.1 In an n-type silicon, which of the following statement is true: 

(a) Electrons are majority carriers and trivalent atoms are the 
dopants. 

(b) Electrons are minority carriers and pentavalent atoms are the 
dopants. 

(c) Holes are minority carriers and pentavalent atoms are the 
dopants. 

(d) Holes are majority carriers and trivalent atoms are the dopants. 

14.2 Which of the statements given in Exercise 14.1 is true for p-type 
semiconductos. 

14.3 Carbon, silicon and germanium have four valence electrons 
each. These are characterised by valence and conduction bands 
separated by energy band gap respectively equal to ( E g ) c , (E g ) Si and 

(E g ) Ge . Which of the following statements is true? 

( a ) ( E g)si < ( E g)ce < ( E g)c 

(b) (Eg) C < (Eg) Qe > (Eg) Sj 
(C) (Eg) C > (Eg) S i > (Eg) Ge 







(d) (Eg) C = (Eg) S| = (E g ) Qe 


14.4 In an unbiased p-n junction, holes diffuse from the p-region to n- 
region because 

(a) free electrons in the n-region attract them. 

(b) they move across the junction by the potential difference. 

(c) hole concentration in p-region is more as compared to n-region. 

(d) All the above. 

14.5 When a forward bias is applied to a p-n junction, it 

(a) raises the potential barrier. 

(b) reduces the majority carrier current to zero. 

(c) lowers the potential barrier. 

(d) None of the above. 

14.6 In half-wave rectification, what is the output frequency if the 
input frequency is 50 Hz. What is the output frequency of a full-wave 
rectifier for the same input frequency. 

14.7 A p-n photodiode is fabricated from a semiconductor with band 
gap of 2.8 eV. Can it detect a wavelength of 6000 nm? 

Additional Exercises 

14.8 The number of silicon atoms per m 3 is 5 * 10 28 . This is doped 
simultaneously with 5 * 10 22 atoms per m 3 of Arsenic and 5 * 10 2 ° 
per m 3 atoms of Indium. Calculate the number of electrons and holes. 
Given that n- t = 1.5 * 10 16 rrr 3 . Is the material n-type or p-type? 




14.9 In an intrinsic semiconductor the energy gap E g is 1.2eV. Its hole 

mobility is much smaller than electron mobility and independent of 
temperature. What is the ratio between conductivity at 600K and that 
at 300K? Assume that the temperature dependence of intrinsic 
carrier concentration r?j is given by 


where n 0 is a constant. 


14.10 In a p-n junction diode, the current I can be expressed as 


where / 0 is called the reverse saturation current, V is the voltage 

across the diode and is positive for forward bias and negative for 
reverse bias, and / is the current through the diode, k B is the 

Boltzmann constant (8.6x10 -5 eV/K) and T is the absolute 
temperature. If for a given diode / 0 = 5 * 10~ 12 A and T = 300 K, then 


(a) What will be the forward current at a forward voltage of 0.6 V? 

(b) What will be the increase in the current if the voltage across the 
diode is increased to 0.7 V? 

(c) What is the dynamic resistance? 

(d) What will be the current if reverse bias voltage changes from 1 V 
to 2 V? 

14.11 You are given the two circuits as shown in Fig. 14.44. Show 
that circuit (a) acts as OR gate while the circuit (b) acts as AND gate. 











(a) (b) 


Figure 14.36 

14.12 Write the truth table for a NAND gate connected as given in 
Fig. 14.37. 



Hence identify the exact logic operation carried out by this circuit. 


14.13 You are given two circuits as shown in Fig. 14.46, which 
consist of NAND gates. Identify the logic operation carried out by the 
two circuits. 



(a) 


(b) 


Figure 14.38 


14.14 Write the truth table for circuit given in Fig. 14.47 below 
consisting of NOR gates and identify the logic operation (OR, AND, 
NOT) which this circuit is performing. 
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Figure 14.39 


(Hint: A = 0, B = 1 then A and B inputs of second NOR gate will be 0 
and hence Y=1. Similarly work out the values of Y for other 
combinations of A and B. Compare with the truth table of OR, AND, 
NOT gates and find the correct one.) 


14.15 Write the truth table for the circuits given in Fig. 14.40 
consisting of NOR gates only. Identify the logic operations (OR, AND, 
NOT) performed by the two circuits. 




v. 


Figure 14.40 
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Chapter Fifteen 


Communication Systems 


15.1 Introduction 

Communication is the act of transmission of information. Every living 
creature in the world experiences the need to impart or receive 
information almost continuously with others in the surrounding world. For 
communication to be successful, it is essential that the sender and the 
receiver understand a common language. Man has constantly made 
endeavors to improve the quality of communication with other human 
beings. Languages and methods used in communication have kept 
evolving from prehistoric to modern times, to meet the growing demands 
in terms of speed and complexity of information. It would be worthwhile to 
look at the major milestones in events that promoted developments in 
communications, as presented in Table 15.1. 

Modern communication has its roots in the 19th and 20th century in the 
work of scientists like J.C. Bose, F.B. Morse, G. Marconi and Alexander 
Graham Bell. The pace of development seems to have increased 
dramatically after the first half of the 20th century. We can hope to see 
many more accomplishments in the coming decades. The aim of this 
chapter is to introduce the concepts of communication, namely the mode 
of communication, the need for modulation, production and detection of 
amplitude modulation. 

15.2 Elements of a Communication System 

Communication pervades all stages of life of all living creatures. 




Irrespective of its nature, every communication system has three 
essential elements- transmitter, medium/channel and receiver. The block 
diagram shown in Fig. 15.1 depicts the general form of a communication 
system. 



Table 15.1 Some major milestones in tee history of communication 


Year 

Event 

Remarks 

Around 
1565 A.D. 

The re porting of the delivery of 
a child by queen using drum 
beats from a distant place to 
King Akbar. 

It is believed that minister Blrbal 
experimented with the arrangement to 
decide the number of drummers posted 
between the place where the queen 
stayed and the place where the king 
stayed. 

1835 

Invention of telegraph by 
Samuel F.B. Morse and Sir 
Charles Wheatstone 

It resulted in tremendous growth of 
messages through post offices and 
reduced physical travel of messengers 
considerably. 

1876 

Telephone Invented by 
Alexander Graham Bell and 
Antonio Meuccl 

Perhaps the most widely used means of 
communication in the history of 
mankind. 

1895 

Jagadls Chandra Bose and 
Guglielmo Marconi 

demonstrated wireless 
telegraphy. 

It meant a giant leap - from an era of 
communication using wires to 
communicating without using wires, 
(wireless] 

1936 

Television broadcast(John 
Logl Baird) 

First television broadcast by BBC 

1955 

First radio FAX transmitted 
across continent. (Alexander 
Bain) 

The Idea of FAX transmission was 
patented by Alexander Bain in 1843. 

1968 

ARPANET- the first Internet 
came into existence(J.C.R. 

Llckllder) 

ARPANET was a project undertaken by 
the U.S. defence department. It allowed 
file transfer from one computer to 
another connected to the network. 

1975 

Fiber optics developed at Bell 
Laboratories 

Fiber optical systems are superior and 
more economical compared to 
traditional communication systems. 

1989-91 

Tim Berners-Lee Invented the 

World Wide Web. 

WWW may be regarded as tire mammoth 
encyclopedia of knowledge accessible to 
everyone round tire clock throughout the 
year. 










Communication System 



Noise 


Figure 15.1 Block diagram of a generalised communication system. 

In a communication system, the transmitter is located at one place, the 
receiver is located at some other place (far or near) separate from the 
transmitter and the channel is the physical medium that connects them. 
Depending upon the type of communication system, a channel may be in 
the form of wires or cables connecting the transmitter and the receiver or 
it may be wireless. The purpose of the transmitter is to convert the 
message signal produced by the source of information into a form 
suitable for transmission through the channel. If the output of the 
information source is a non-electrical signal like a voice signal, a 
transducer converts it to electrical form before giving it as an input to the 
transmitter. When a transmitted signal propagates along the channel it 
may get distorted due to channel imperfection. Moreover, noise adds to 
the transmitted signal and the receiver receives a corrupted version of 
the transmitted signal. The receiver has the task of operating on the 
received signal. It reconstructs a recognisable form of the original 
message signal for delivering it to the user of information. 

There are two basic modes of communication: point-to-point and 
broadcast. 

In point-to-point communication mode, communication takes place over a 
link between a single transmitter and a receiver. Telephony is an example 





















of such a mode of communication. In contrast, in the broadcast mode, 
there are a large number of receivers corresponding to a single 
transmitter. Radio and television are examples of broadcast mode of 
communication. 

15.3 Basic Terminology Used in Electronic 
Communication Systems 

By now, we have become familiar with some terms like information 
source, transmitter, receiver, channel, noise, etc. It would be easy to 
understand the principles underlying any communication, if we get 
ourselves acquainted with the following basic terminology. 

(i) Transducer: Any device that converts one form of energy into another 
can be termed as a transducer. In electronic communication systems, we 
usually come across devices that have either their inputs or outputs in the 
electrical form. An electrical transducer may be defined as a device that 
converts some physical variable (pressure, displacement, force, 
temperature, etc.) into corresponding variations in the electrical signal at 
its output. 


Jagadis Chandra Bose (1858 - 1937) 



Jagadis Chandra Bose (1858 - 1937) He developed an apparatus for generating ultrashort 
electro-magnetic waves and studied their quasi-optical properties. He was said to be the first 
to employ a semiconductor like galena as a self-recovering detector of electromagnetic 
waves. Bose published three papers in the British magazine, The Electrician’ of 27 Dec. 
1895. His invention was published in the ‘Proceedings of The Royal Society’ on 27 April 1899 
over two years before Marconi’s first wireless communication on 13 December 1901. Bose 
also invented highly sensitive instruments for the detection of minute responses by living 
organisms to external stimulii and established parallelism between animal and plant tissues. 


(ii) Signal: Information converted in electrical form and suitable for 
transmission is called a signal. Signals can be either analog or digital. 
Analog signals are continuous variations of voltage or current. They are 
essentially single-valued functions of time. Sine wave is a fundamental 
analog signal. All other analog signals can be fully understood in terms of 
their sine wave components. Sound and picture signals in TV are analog 
in nature. Digital signals are those which can take only discrete stepwise 
values. Binary system that is extensively used in digital electronics 
employs just two levels of a signal. ‘0’ corresponds to a low level and T 
corresponds to a high level of voltage/current. There are several coding 
schemes useful for digital communication. They employ suitable 
combinations of number systems such as the binary coded decimal 


(BCD)*. American Standard Code for Information Interchange (ASCII)** 
is a universally popular digital code to represent numbers, letters and 
certain characters. 

(Nowadays, optical signals are also in use.) 

(Hi) Noise: Noise refers to the unwanted signals that tend to disturb the 
transmission and processing of message signals in a communication 
system. The source generating the noise may be located inside or 
outside the system. 

(iv) Transmitter: A transmitter processes the incoming message signal so 
as to make it suitable for transmission through a channel and subsequent 
reception. 

(v) Receiver: A receiver extracts the desired message signals from the 
received signals at the channel output. 

(vi) Attenuation: The loss of strength of a signal while propagating 
through a medium is known as attenuation. 

* In BCD, a digit is usually represented by four binary (0 or 1) bits. For 
example the numbers 0, 1, 2, 3, 4 in the decimal system are written as 
0000, 0001, 0010, 0011 and 0100. 1000 would represent eight. 

** It is a character encoding in terms of numbers based on English 
alphabet since the computer can only understand numbers. 

(viii) Amplification: It is the process of increasing the amplitude (and 
consequently the strength) of a signal using an electronic circuit called 
the amplifier (reference Chapter 14). Amplification is necessary to 
compensate for the attenuation of the signal in communication systems. 
The energy needed for additional signal strength is obtained from a DC 
power source. Amplification is done at a place between the source and 
the destination wherever signal strength becomes weaker than the 



required strength. 

(viii) Range: It is the largest distance between a source and a destination 
up to which the signal is received with sufficient strength. 

(ix) Bandwidth: Bandwidth refers to the frequency range over which an 
equipment operates or the portion of the spectrum occupied by the 
signal. 

(x) Modulation: The original low frequency message/information signal 
cannot be transmitted to long distances because of reasons given in 
Section 15.7. Therefore, at the transmitter, information contained in the 
low frequency message signal is superimposed on a high frequency 
wave, which acts as a carrier of the information. This process is known 
as modulation. As will be explained later, there are several types of 
modulation, abbreviated as AM, FM and PM. 

(xi) Demodulation: The process of retrieval of information from the carrier 
wave at the receiver is termed demodulation. This is the reverse process 
of modulation. 

(xii) Repeater: A repeater is a combination of a receiver and a transmitter. 
A repeater, picks up the signal from the transmitter, amplifies and 
retransmits it to the receiver sometimes with a change in carrier 
frequency. Repeaters are used to extend the range of a communication 
system as shown in Fig. 15.2. A communication satellite is essentially a 
repeater station in space. 




Figure 15.2 Use of repeater station to increase the range of 

communication. 


15.4 Bandwidth of Signals 

In a communication system, the message signal can be voice, music, 
picture or computer data. Each of these signals has different ranges of 
frequencies. The type of communication system needed for a given 
signal depends on the band of frequencies which is considered essential 
for the communication process. 

For speech signals, frequency range 300 Hz to 3100 Hz is considered 
adequate. Therefore speech signal requires a bandwidth of 2800 Hz 
(3100 Hz - 300 Hz) for commercial telephonic communication. To 
transmit music, an approximate bandwidth of 20 kHz is required because 
of the high frequencies produced by the musical instruments. The audible 
range of frequencies extends from 20 Hz to 20 kHz. 

Video signals for transmission of pictures require about 4.2 MHz of 
bandwidth. A TV signal contains both voice and picture and is usually 
allocated 6 MHz of bandwidth for transmission. 




(a) Rectangular wave 
fti) Fundamental [vj 
fc) FundamentalfvJ - 



second harmonic (2v a ] + 
third harmonic C3v.J 


-1.5 

Figure 15.3 Approximation of a rectangular wave in terms of a fundamental sine wave and its 

harmonics. 

In the preceeding paragraph, we have considered only analog signals. 
Digital signals are in the form of rectangular waves as shown in Fig. 15.3. 
One can show that this rectangular wave can be decomposed into a 
superposition of sinusoidal waves of frequencies v 0 , 2v 0 , 3v 0 , 4v 0 ... nv 0 

where n is an integer extending to infinity and v 0 = 1/T 0 . The fundamental 

(v 0 ), fundamental (v 0 ) + second harmonic (2v 0 ), and fundamental (v 0 ) + 

second harmonic (2v 0 ) + third harmonic (3v 0 ), are shown in the same 

figure to illustrate this fact. It is clear that to reproduce the rectangular 
wave shape exactly we need to superimpose all the harmonics v 0 , 2v 0 , 

3v 0 , 4v 0 ..., which implies an infinite bandwidth. However, for practical 

purposes, the contribution from higher harmonics can be neglected, thus 
limiting the bandwidth. As a result, received waves are a distorted version 
of the transmitted one. If the bandwidth is large enough to accommodate 
a few harmonics, the information is not lost and the rectangular signal is 
more or less recovered. This is so because the higher the harmonic, less 












is its contribution to the wave form. 


15.5 Bandwidth of Transmission Medium 

Similar to message signals, different types of transmission media offer 
different bandwidths. The commonly used transmission media are wire, 
free space and fiber optic cable. Coaxial cable is a widely used wire 
medium, which offers a bandwidth of approximately 750 MHz. Such 
cables are normally operated below 18 GHz. Communication through 
free space using radio waves takes place over a very wide range of 
frequencies: from a few hundreds of kHz to a few GHz. This range of 
frequencies is further subdivided and allocated for various services as 
indicated in Table 15.2. Optical communication using fibers is performed 
in the frequency range of 1 THz to 1000 THz (microwaves to ultraviolet). 
An optical fiber can offer a transmission bandwidth in excess of 100 GHz. 

Spectrum allocations are arrived at by an international agreement. The 
International Telecommunication Union (ITU) administers the present 
system of frequency allocations. 


Table 15.2 Some important wireless communication frequency bands 

Service 

Frequency bands 

Comments 

Standard AM broadcast 

540-1600 kHz 


FM broadcast 

88-108 MHz 


Television 

54-72 MHz 

VHF (very high frequencies) 


76-88 MHz 

TV 


174-216 MHz 

UHF (ultra high frequencies) 


420-890 MHz 

TV 

Cellular Mobile Radio 

896-901 MHz 

Mobile to base station 


840-935 MHz 

Base station to mobile 

Satellite Communication 

5.925-6,425 GHz 

Uplink 


3.7-4 2 GHz 

Downlink 


15.6 Propagation of Electromagnetic Waves 

In communication using radio waves, an antenna at the transmitter 
radiates the Electromagnetic waves (em waves), which travel through the 
space and reach the receiving antenna at the other end. As the em wave 
travels away from the transmitter, the strength of the wave keeps on 
decreasing. Several factors influence the propagation of em waves and 
the path they follow. At this point, it is also important to understand the 
composition of the earth’s atmosphere as it plays a vital role in the 
propagation of em waves. A brief discussion on some useful layers of the 
atmosphere is given in Table 15.3. 

15.6.1 GROUND WAVE 

To radiate signals with high efficiency, the antennas should have a size 
comparable to the wavelength X of the signal (at least ~ A/4). At longer 
wavelengths (i.e., at lower frequencies), the antennas have large physical 









size and they are located on or very near to the ground. In standard AM 
broadcast, ground based vertical towers are generally used as 
transmitting antennas. For such antennas, ground has a strong influence 
on the propagation of the signal. The mode of propagation is called 
surface wave propagation and the wave glides over the surface of the 
earth. A wave induces current in the ground over which it passes and it is 
attenuated as a result of absorption of energy by the earth. The 
attenuation of surface waves increases very rapidly with increase in 
frequency. The maximum range of coverage depends on the transmitted 
power and frequency (less than a few MHz). 


Table 15.3 

Different layers of atmosphere and their interaction with the 

PROPAGATING ELECTROMAGNETIC WAVES 

Name of the 
stratum (layer) 


Approximate height 
over earth’s surface 

Exists during 

Frequencies most 
affected 

Troposphere 


10 km 

Day and 
night 

VHF (up to several GHz) 

D (part of 
stratosphere] 

P 

A 

R 

T 

65-75 km 

Day only 

Reflects LF. absorbs MF 
and HF to some degree 

E (part of 
Stratosphere] 

S 

0 

F 

100 km 

Day only 

Helps surface waves, 
reflects HF 

F, (Part of 
Mesosphere) 

I 

0 

N 

0 

q 

170-190 km 

Daytime, 
merges with 

F 2 at night 

Partlallv absorbs HF 
w T avesyet allowing them 

to reach F' 

2 

F 

r 2 

L_l 

P 

u 

300 km at night. 

Day and 

Efliclentlv reflects HF 

J 

(Thermosphere) 

in 

E 

R 

E 

250-400 km 
during daytime 

night 

w T aves. particularly at 
night 


15.6.2 SKY WAVES 

In the frequency range from a few MHz up to 30 to 40 MHz, long distance 
communication can be achieved by ionospheric reflection of radio waves 












back towards the earth. This mode of propagation is called sky wave 
propagation and is used by short wave broadcast services. The 
ionosphere is so called because of the presence of a large number of 
ions or charged particles. It extends from a height of ~ 65 Km to about 
400 Km above the earth’s surface. Ionisation occurs due to the 
absorption of the ultraviolet and other high-energy radiation coming from 
the sun by air molecules. The ionosphere is further subdivided into 
several layers, the details of which are given in Table 15.3. The degree of 
ionisation varies with the height. The density of atmosphere decreases 
with height. At great heights the solar radiation is intense but there are 
few molecules to be ionised. Close to the earth, even though the 
molecular concentration is very high, the radiation intensity is low so that 
the ionisation is again low. However, at some intermediate heights, there 
occurs a peak of ionisation density. The ionospheric layer acts as a 
reflector for a certain range of frequencies (3 to 30 MHz). 
Electromagnetic waves of frequencies higher than 30 MHz penetrate the 
ionosphere and escape. These phenomena are shown in the Fig. 15.4. 
The phenomenon of bending of em waves so that they are diverted 
towards the earth is similar to total internal reflection in optics*. 



Ionospheric 

layers 


Figure 15.4 Sky wave propagation. The layer nomenclature is given in 

Table 15.3. 



15.6.3 SPACE WAVE 

Another mode of radio wave propagation is by space waves. A space 
wave travels in a straight line from transmitting antenna to the receiving 
antenna. Space waves are used for line-of-sight (LOS) communication as 
well as satellite communication. At frequencies above 40 MHz, 
communication is essentially limited to line-of-sight paths. At these 
frequencies, the antennas are relatively smaller and can be placed at 
heights of many wavelengths above the ground. Because of line-of-sight 
nature of propagation, direct waves get blocked at some point by the 
curvature of the earth as illustrated in Fig. 15.5. If the signal is to be 
received beyond the horizon then the receiving antenna must be high 
enough to intercept the line-of-sight waves. 








Figure 15.5 Line of sight communication by space waves. 

If the transmitting antenna is at a height h T , then you can show that the 


distance to the horizon d T is given as d T = . where R is the 

radius of the earth (approximately 6400 km). d T is also called the radio 

horizon of the transmitting antenna. With reference to Fig. 15.5 the 
maximum line-of-sight distance d M between the two antennas having 

heights h T and h R above the earth is given by 

d M = pRh T + ftRh„ (15.1) 
where h R is the height of receiving antenna. 


Communication satellite 















































Figure 15.6 Various propagation modes for em waves. 


Television broadcast, microwave links and satellite communication are 
some examples of communication systems that use space wave mode of 
propagation. Figure 15.6 summarises the various modes of wave 
propagation discussed so far. 


Example 15.1 A transmitting antenna at the top of a tower has a height 32 m and the height 
of the receiving antenna is 50 m. What is the maximum distance between them for 

satisfactory communication in LOS mode? Given radius of earth 6.4 * 10® m. 


Solution 


d ni = ^6 -'64Mlfr xS2 + ^2 v64XlO 5 x50 m 

= 64x10 2 x>/i0+ 8x 10 3 x>/l0 m 
= 1 44 x 10* x VTo m = 45.5 km 


15.6 Modulation and its Necessity 

As already mentioned, the purpose of a communication system is to 
transmit information or message signals. Message signals are also called 
baseband signals, which essentially designate the band of frequencies 
representing the original signal, as delivered by the source of information. 
No signal, in general, is a single frequency sinusoid, but it spreads over a 
range of frequencies called the signal bandwidth. Suppose we wish to 
transmit an electronic signal in the audio frequency (AF) range 
(baseband signal frequency less than 20 kHz) over a long distance 
directly. Let us find what factors prevent us from doing so and how we 







overcome these factors. 


15.7.1 SIZE OF THE ANTENNA OR AERIAL 

For transmitting a signal, we need an antenna or an aerial. This antenna 
should have a size comparable to the wavelength of the signal (at least 
A/4 in dimension) so that the antenna properly senses the time variation 
of the signal. For an electromagnetic wave of frequency 20 kHz, the 
wavelength A is 15 km. Obviously, such a long antenna is not possible to 
construct and operate. Hence direct transmission of such baseband 
signals is not practical. We can obtain transmission with reasonable 
antenna lengths if transmission frequency is high (for example, if v is 1 
MHz, then A is 300 m). Therefore, there is a need of translating the 
information contained in our original low frequency baseband signal into 
high or radio frequencies before transmission.' 

15.7.2 EFFECTIVE POWER RADIATED BY AN ANTENNA 

A theoretical study of radiation from a linear antenna (length I) shows that 
the power radiated is proportional to (l/A)2 . This implies that for the same 
antenna length, the power radiated increases with decreasing A, i.e., 
increasing frequency. Hence, the effective power radiated by a long 
wavelength baseband signal would be small. For a good transmission, 
we need high powers and hence this also points out to the need of using 
high frequency transmission. 

15.7.3 MIXING UP OF SIGNALS FROM DIFFERENT 
TRANSMITTERS 

Another important argument against transmitting baseband signals 
directly is more practical in nature. Suppose many people are talking at 


the same time or many transmitters are transmitting baseband 
information signals simultaneously. All these signals will get mixed up 
and there is no simple way to distinguish between them. This points out 
towards a possible solution by using communication at high frequencies 
and allotting a band of frequencies to each message signal for its 
transmission. 
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Figure 15.7 (a) Sinusoidal, and (b) pulse shaped signals. 

The above arguments suggest that there is a need for translating the 
original low frequency baseband message or information signal into high 
frequency wave before transmission such that the translated signal 
continues to possess the information contained in the original signal. In 
doing so, we take the help of a high frequency signal, known as the 
carrier wave, and a process known as modulation which attaches 
information to it. The carrier wave may be continuous (sinusoidal) or in 
the form of pulses as shown in Fig. 15.7. 

A sinusoidal carrier wave can be represented as 

















c(t) = A c sin (w c t + (p) (15.2) 

where c(t) is the signal strength (voltage or current), A c is the amplitude, 

w c ( = 2ttv c ) is the angular frequency and cp is the initial phase of the 

carrier wave. During the process of modulation, any of the three 
parameters, viz A c , w c and cp, of the carrier wave can be controlled by the 

message or information signal. This results in three types of modulation: 
(i) Amplitude modulation (AM), (ii) Frequency modulation (FM) and (iii) 
Phase modulation (PM), as shown in Fig. 15.8. 
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Figure 15.8 Modulation of a carrier wave: (a) a sinusoidal carrier wave; (b) a modulating signal; (c) 
amplitude modulation; (d) frequency modulation; and (e) phase modulation. 


Similarly, the significant characteristics of a pulse are: pulse amplitude, 
pulse duration or pulse Width, and pulse position (denoting the time of 
rise or fall of the pulse amplitude) as shown in Fig. 15.7(b). Hence, 
different types of pulse modulation are: (a) pulse amplitude modulation 






















































(PAM), (b) pulse duration modulation (PDM) or pulse width modulation 
(PWM), and (c) pulse position modulation (PPM). In this chapter, we shall 
confine to amplitude modulation only. 


15.8 Amplitude Modulation 

In amplitude modulation the amplitude of the carrier is varied in 
accordance with the information signal. Here we explain amplitude 
modulation process using a sinusoidal signal as the modulating signal. 

Let c(t) = A c sin w c t represent carrier wave and m(t) = A m sin a) m t 

represent the message or the modulating signal where aj m = 2nf m is the 

angular frequency of the message signal. The modulated signal c m (t) 

can be written as 

c m (t) = (Ac + A m sin u) m t) sin w c t 


= A: 


i -A , 

1 + 


sinco.t (15.3) 


Note that the modulated signal now contains the message signal. This 
can also be seen from Fig. 15.8(c). From Eq. (15.3), we can write, 

c 17 , (t) = A, sinoxi l aA c sinco m t sinoxt (15.4) 

Here p = A m /A c is the modulation index; in practice, p is kept < l to 
avoid distortion. 


Using the trignomatric relation sinA sinB = 14 (cos(A- B) - cos (A + B), 
we can write c m (t) of Eq. (15.4) as 


c m (0= a c 


sin G) c t + 


M 

2 


cosfty, - a> m ) t - 


M 

2 


cos(ty L + a> m )t 


(15.5) 






Here w c - w m and qj c + co m are respectively called the lower side and 

upper side frequencies. The modulated signal now consists of the carrier 
wave of frequency oo c plus two sinusoidal waves each with a frequency 

slightly different from, known as side bands. The frequency spectrum of 
the amplitude modulated signal is shown in Fig. 15.9. 



As long as the broadcast frequencies (carrier waves) are sufficiently 
spaced out so that sidebands do not overlap, different stations can 
operate without interfering with each other. 


Example 15.2 A message signal of frequency 10 kHz and peak voltage of 10 volts is used to 
modulate a carrier of frequency 1 MHz and peak voltage of 20 volts. Determine (a) 
modulation index, (b) the side bands produced. 

Solution 

(a) Modulation index =10/20 = 0.5 

(b) The side bands are at (1000+10 kHz)=1010 kHz and 
(1000-10 kHz) = 990 kHz. 


15.9 Production of Amplitude Modulated Wave 
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Figure 15.10 Block diagram of a simple modulator 
for obtaining an AM signal. 

Amplitude modulation can be produced by a variety of methods. A 
conceptually simple method is shown in the block diagram of Fig. 15.10. 

Here the modulating signal A m sin aj m t is added to the carrier signal A c 

sin w c t to produce the signal x (t). 

This signal x (t) = A m sinaj m t + A c sina) c t is passed through a square law 
device which is a non-linear device which produces an output 
y (t) = B x (t) + Cx 2 (t) (15.6) 

where B and C are constants. Thus, 
y (t)= BA m sin w m t + BA C sin oj c t 


+C A 2 sin 2 0) m t + A 2 sin 2 6)1 + 2A TT1 \ sin to m t sin £t) t t (15.7) 


= BA m sin oj m t + BA C sin w c t 


CAi CA: 


+ ■ 


m 


2 


+ 


C An 

2 


cos2 co m t - 


CA 2 c 

— cos2« r r 
2 


+ CA m A c cos (a) c - u> m ) t - CA m A c cos (w c + aj m ) t (15.8) 

where the trigonometric relations sin2A = (1 - cos2A)/2 and the relation 
for sinA sinB mentioned earlier are used. 














In Eq. (15.8), there is a dc term C/2 ( A;;. + Af.) and sinusoids of 

frequencies aj m , 2w m , u) c , 2w c w c - oj m and w c + oj m . As shown in Fig. 

15.10 this signal is passed through a band pass filter* which rejects dc 
and the sinusoids of frequencies w m , 2aj m and 2 w c and retains the 

frequencies u) c , w c - a) m and w c + aj m . The output of the band pass filter 

therefore is of the same form as Eq. (15.5) and is therefore an AM wave. 
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Figure 15.11 Block diagram of a transmitter. 


It is to be mentioned that the modulated signal cannot be transmitted as 
such. The modulator is to be followed by a power amplifier which 
provides the necessary power and then the modulated signal is fed to an 
antenna of appropriate size for radiation as shown in Fig. 15.11. 


15.10 Detection of Amplitude Modulated Wave 

The transmitted message gets attenuated in propagating through the 
channel. The receiving antenna is therefore to be followed by an amplifier 
and a detector. In addition, to facilitate further processing, the carrier 
frequency is usually changed to a lower frequency by what is called an 
intermediate frequency (IF) stage preceding the detection. The detected 
signal may not be strong enough to be made use of and hence is 
required to be amplified. A block diagram of a typical receiver is shown in 
Fig. 15.12 











Figure 15.12 Block diagram of a receiver. 


Detection is the process of recovering the modulating signal from the 
modulated carrier wave. We just saw that the modulated carrier wave 
contains the frequencies w c and co c ± oj m . In order to obtain the original 

message signal m(t) of angular frequency oj m , a simple method is shown 

in the form of a block diagram in Fig. 15.13. 



AM input wave Rectified wave Output (without RF component) 

FIGURE 15.13 Block diagram of a detector for AM signal. The quantity on y-axis can be current 

or voltage. 

The modulated signal of the form given in (a) of fig. 15.13 is passed 
through a rectifier to produce the output shown in (b). This envelope of 
signal (b) is the message signal. In order to retrieve m(t), the signal is 
passed through an envelope detector (which may consist of a simple RC 
circuit). 

In the present chapter we have discussed some basic concepts of 
communication and communication systems. We have also discussed 





































one specific type of analog modulation namely Amplitude Modulation 
(AM). Other forms of modulation and digital communication systems play 
an important role in modern communication. These and other exciting 
developments are taking place everyday. 

So far we have restricted our discussion to some basic communication 
systems. Before we conclude this chapter, it is worth taking a glance at 
some of the communication systems (see the box) that in recent times 
have brought major changes in the way we exchange information even in 
our day-to-day life: 


ADDITIONAL INFORMATION 

THE INTERNET 

It is a system with billions of users worldwide. It permits communication and sharing of all 
types of information between any two or more computers connected through a large and 
complex network. It was started in 1960’s and opened for public use in 1990’s. With the 
passage of time it has witnessed tremendous growth and it is still expanding its reach. Its 
applications include 

(i) E mail - It permits exchange of text/graphic material using email software. We can write a 
letter and send it to the recipient through ISP’s (Internet Service Providers) who work like the 
dispatching and receiving post offices. 

(ii) File transfer - A FTP (File Transfer Programmes) allows transfer of files/software from one 
computer to another connected to the Internet. 

(iii) World Wide Web (WWW) - Computers that store specific information for sharing with 
others providewebsites either directly or through web service providers. Government 
departments, companies, NGO’s (Non-Government Organisations) and individuals can post 
information about their activities for restricted or free use on their websites. This information 
becomes accessible to the users. Several search engines like Google, Yahoo! etc., help us in 


finding information by listing the related websites. Hypertext is a powerful feature of the web 
that automatically links relevant information from one page on the web to another usingHTML 
(hypertext markup language). 

(iv) E-commerce - Use of the Internet to promote business using electronic means such as 
using credit cards is called E-commerce. Customers view images and receive all the 
information about various products or services of companies through their websites. They 
can do on-line shopping from home/office. Goods are dispatched or services are provided by 
the company through mail/courier. 

(v) Chat - Real time conversation among people with common interests through typed 
messages is called chat. Everyone belonging to the chat group gets the message 
instantaneously and can respond rapidly. 

FACSIMILE (FAX) 

It scans the contents of a document (as an image, not text) to create electronic signals. 
These signals are then sent to the destination (another FAX machine) in an orderly manner 
using telephone lines. At the destination, the signals are reconverted into a replica of the 
original document. Note that FAX provides image of a static document unlike the image 
provided by television of objects that might be dynamic. 


MOBILE TELEPHONY 

The concept of mobile telephony was developed first in 1970’s and it was fully implemented 
in the following decade. The central concept of this system is to divide the service area into a 
suitable number of cellscentred on an office called MTSO (Mobile Telephone Switching 
Office). Each cell contains a low-power transmitter called a base station and caters to a large 
number of mobile receivers (popularly called cell phones). Each cell could have a service 
area of a few square kilometers or even less depending upon the number of customers. 


When a mobile receiver crosses the coverage area of one base station, it is necessary for the 
mobile user to be transferred to another base station. This procedure is 
called handoveror handoff. This process is carried out very rapidly, to the extent that the 
consumer does not even notice it. Mobile telephones operate typically in the UHF range of 
frequencies (about 800-950 MHz). 


Summary 

1. Electronic communication refers to the faithful transfer of information or message 
(available in the form of electrical voltage and current) from one point to another point. 

2. Transmitter, transmission channel and receiver are three basic units of a communication 
system. 

3. Two important forms of communication system are: Analog and Digital. The information to 
be transmitted is generally in continuous waveform for the former while for the latter it has 
only discrete or quantised levels. 

4. Every message signal occupies a range of frequencies. The bandwidth of a message 
signal refers to the band of frequencies, which are necessary for satisfactory transmission of 
the information contained in the signal. Similarly, any practical communication system permits 
transmission of a range of frequencies only, which is referred to as the bandwidth of the 
system. 

5. Low frequencies cannot be transmitted to long distances. Therefore, they are 
superimposed on a high frequency carrier signal by a process known as modulation. 

6. In modulation, some characteristic of the carrier signal like amplitude, frequency or phase 
varies in accordance with the modulating or message signal. Correspondingly, they are called 
Amplitude Modulated (AM), Frequency Modulated (FM) or Phase Modulated (PM) waves. 


7. Pulse modulation could be classified as: Pulse Amplitude Modulation (PAM), Pulse 


Duration Modulation (PDM) or Pulse Width Modulation (PWM) and Pulse Position Modulation 
(PPM). 

8. For transmission over long distances, signals are radiated into space using devices called 
antennas. The radiated signals propagate as electromagnetic waves and the mode of 
propagation is influenced by the presence of the earth and its atmosphere. Near the surface 
of the earth, electromagnetic waves propagate as surface waves. Surface wave propagation 
is useful up to a few MHz frequencies. 

9. Long distance communication between two points on the earth is achieved through 
reflection of electromagnetic waves by ionosphere. Such waves are called sky waves. Sky 
wave propagation takes place up to frequency of about 30 MHz. Above this frequency, 
electromagnetic waves essentially propagate as space waves. Space waves are used for 
line-of-sight communication and satellite communication. 

10. If an antenna radiates electromagnetic waves from a height hT, then the range dT is 
given by ^2Rh^ where R is the radius of the earth. 

11. Amplitude modulated signal contains frequencies ( (0 0) ), u>c and ( OJ r +- CO ..,). 

12. Amplitude modulated waves can be produced by application of the message signal and 
the carrier wave to a non-linear device, followed by a band pass filter. 

13. AM detection, which is the process of recovering the modulating signal from an AM 
waveform, is carried out using a rectifier and an envelope detector. 


Points to Ponder 

1. In the process of transmission of message/ information signal, noise gets added to the 
signal anywhere between the information source and the receiving end. Can you think of 
some sources of noise? 

2. In the process of modulation, new frequencies called sidebands are generated on either 




side (higher and lower than the carrier frequency) of the carrier by an amount equal to the 
highest modulating frequency. Is it possible to retrieve the message by transmitting (a) only 
the side bands, (b) only one side band? 

3. In amplitude modulation, modulation index p < 1 is used. What will happen if p > 1? 

r ~ -'n 


Exercises 


15.1 Which of the following frequencies will be suitable for beyond- 
the-horizon communication using sky waves? 

(a) 10 kHz 

(b) 10 MHz 

(c) 1 GHz 

(d) 1000 GHz 

15.2 Frequencies in the UHF range normally propagate by means of: 

(a) Ground waves. 

(b) Sky waves. 

(c) Surface waves. 

(d) Space waves. 

15.3 Digital signals 

(i) do not provide a continuous set of values, 

(ii) represent values as discrete steps, 

(iii) can utilize binary system, and 

(iv) can utilize decimal as well as binary systems. 









Which of the above statements are true? 

(a) (i) and (ii) only 

(b) (ii) and (iii) only 

(c) (i), (ii) and (iii) but not (iv) 

(d) All of (i), (ii), (iii) and (iv). 

15.4 Is it necessary for a transmitting antenna to be at the same 
height as that of the receiving antenna for line-of-sight 
communication? A TV transmitting antenna is 81m tall. How much 
service area can it cover if the receiving antenna is at the ground 
level? 

15.5 A carrier wave of peak voltage 12V is used to transmit a 
message signal. What should be the peak voltage of the modulating 
signal in order to have a modulation index of 75%? 

15.6 For an amplitude modulated wave, the maximum amplitude is 
found to be 10V while the minimum amplitude is found to be 2V. 
Determine the modulation index, p. 

What would be the value of p if the minimum amplitude is zero volt? 

15.7 Due to economic reasons, only the upper sideband of an AM 
wave is transmitted, but at the receiving station, there is a facility for 
generating the carrier. Show that if a device is available which can 
multiply two signals, then it is possible to recover the modulating 
signal at the receiver station. 

15.8 A modulating signal is a square wave, as shown in Fig. 15.14. 
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Figure 15.14 

The carrier wave is given by cii) = 2sin(8;;0 volts. 

(i) Sketch the amplitude modulated waveform 

(ii) What is the modulation index? 
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